FIGURE l 



GGGGCT TCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTC TGAAGAC TAACAT T T TGTGAAGT T GTAAAACAGAAAACC TGT TAGAAATGT GG TGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACAC TCCACCATATAGACCCGGCTT TACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTT TT GCACAGTGGCAAT TT TGGGACT GATT TAGAACAGAAAC TCCAT TGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGAT T AAC CC TC TAT GACAC T GCACCT T GC CC TAT T AAC AATGAACGAACACGGC T AC T T T CC A 
GAGATAT TTGATGAAAGGATAAAATATT TCT GTAATGAT TAT GAT TC T CAGGGATTGGGGAAAGG 
T TCACAGAAGT TGCTTAT TCTTCTCTGAAATT TTCAACCACTTAATCAAGGCTGACAGTAACACT 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATT TGATAGAT TATTCTAAAGGATATCAT 
CAAGAAG AC TAT TAAAAACACC TATGCC T AT AC T T T T T TAT C TC AGAAAATAAAG TCAAAAGAC T 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766/ pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVRYKQVHALS PEENVI I KLNKAGLVLG I L S CLGLS I VANFQKTTL FAAHVS GAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW * 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Li 

Important features : 
y Type II transmembrane domain: 

a.!; s 

T r . amino acids 13-33 

jhj Other Transmembrane domains: 

*P amino acids 54-73, 94-113, 160-180, 122-141 

o 

N-myristoylation sites. 

F*i amino acids 57-63, 95-101, 99-105, 124-130, 183-189 

rii 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CT TACACTT CGCC ATGA GT TT CCTCATCGACT CCAGCATCATGAT TACCTCCCAGATACTATT T TTT G 
GAT T T G G GT GGC T TT T CT T C AT GCG C CAATT G TT T AAAGAG TAT GAGATAC GT CAGT AT G T TGT ACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
i : CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
f*\ CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
O TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

SH CTCAGGAATGTGACT GACAC GGATATTCTAGC C CTGGAACGGCGACT GC TGCAAACCATGGATAT GAT 

hi 

w CATAAGCAAAAAGAAAAGGAT GGCAAT GGCACGGAGAACAATGT TCCAGAAGGGGGAAGT GCATAACA 
AACCATCAGGTT TCT GGGGAATGAT AAAAAGT GTTACCACT TCAGCAT CAGGAAGTGAAAATCT TACT 

ill 

%'i C TTAT TCAACAGGAAGT GGAT GCTT TGGAAGAATTAAGCAGGCAGCT T T TTCT GGAAAC AGCTGATCT 

, ATAT GCTAC CAAGGAGAGAAT AGAATACT CCAAAAC CTTCAAGGGGAAATATT TTAATT TTCTT GGTT 

Q ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

I|! GGGAAAACGGATCCT GTCACAAGAGGC AT T GAGAT CACTGT GAAT TAT CTGGGAATCCAAT TTGATGT 

U GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

fs\ 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTAT TAGCACAGATAATGGGCAT GTACT T TGTCTCCTCT GT GCT GCTGATC C GAAT GAGT ATGCC TT T 

III 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACT GCAGT T CAAC TT C TAT CACCGT TGGT TTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCT TGAAC TTAAGCCTACTACAGACTGT TAGAGG CCAGTGGTTT CAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACT CT AGAGT C GAC C T G CAGAAG CT T G GC C GC CAT G GCC CAAC T T G TT TAT T G C AG C TTAT AAT G 
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FIGURE 4 

MS FL I DSS IMI T S QI LFFG FGWL FFMRQL FKDYE I RQ Y WQVI FS VT FAFS CTMFEL 1 1 FE I LGV 
LNS S SRYFHWKMNLCVI LL I LVFMVP FY I GYFI VSNIRLLHKQRLLFSCLLWLT FMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMMjLSGFGAWCPYTYMSYFLRNVTDTDILAXERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVT TSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLLI RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSAL S S I L FL YLAHKQAPEKQMAP 

U Important features : 

2f Signal peptide: 

i.i amino acids 1-23 

ill Potential transmembrane domains: 

*F amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 

l H 425-444 

Q N-glycosylation sites. 

CIS amino acids 67-70, 180-183 and 243-246 

III 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGAT GT CT CGGT TAT GAAGT GGAGCAG T GAGT GTGAGCCT CAACATAGT TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATTATCACCAC CAAAGAT CCCATAT TCAACAC TCAAACT GCAACACAAACAACAGAAT T TAT TGT 
, ; CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
H* C T GCTCC AGC T TC CAC TT C TAT TCCACGGAGAAAAAAAT T GAT T T GT GT CACAGAAGT T T T TAT G 

C| GAAAC TAGCACCAT G T CTACAGAAAC TGAACCAT T T GT T GAAAATAAAGCAGCAT TCAAGAAT GA 

p AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
ff% CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
f I CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
* TGAGGAAT CAAAGAAAACT GATAAAAACC CAGAAGAGT CCAAGAGTC CAAGCAAAAC TACCG TGC 

H GATGCCTGGAAGCTGAAGTT TAGA TGAGACAGAAATGAGGAGACACACCTGAGGCTGGTTTCTTT 
|sf CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
>* CCCT TGGT TCCTAACTGGAATCAGC TCAGGACTGCCATT GGAC TATGGAGT GCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
l a . T TCT AGCC TGGC TAT GTCC TAATAATATCCCAC T GGGAGAAAGGAGT T T T GCAAAGT GCAAGGAC 

D CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
|f| AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
m GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 

GGCAGAAAAGT T T AGCCCC TGAAAGCCATGGAGATTCTCATAAC TTGAGACC TAATC TC T G TAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
CI ACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 
f|| CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTGTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC TCT T AT AAAT T T C TAT T T T TAT C T GAG T T AC AG AAAT GAT T AC T AAGG AAG AT T 
AC T CAGTAATT TGT T TAAAAAGTAATAAAAT TCAACAAACAT T TGC TGAATAGC TAC TATAT GT C 
AAGT GCT GT GCAAGGTAT TACACTCT GTAAT TGAATATT AT T CC TCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T TGC T GAGAC T AAT C T TAT T CAT T T TC TC TAATAT GGC AACCAT TATAAC C T T AAT T 
TATTATTAACATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCC 
ATTAACAAATG TAT CACT AGCCCTCC TT T TT CCAACAAGAAGGGAC TGAGAGAT GCAGAAATATT 
T GT GACAAAAAAT TAAAGCAT T TAGAAAAC T T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWVGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEI ITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I C VTEVFMET S TMS TE TE P FVENKAAFKNEAAGFGGVPTALLVLALL FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKVA/KEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 



Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N- glycosylate on site. 

amino acids ■ 53.-57, 130-134, 289-293 



IK Casein kinase II phosphorylation site. 

y*V amino acids 145-149, 214-218 

%sl Tyrosine kinase phosphorylation site. . 

nr.. 

5 ' c? amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATT GCGCAGCGCGGT GGAA.GAGATGGAGGCAGAAGAAGCT GC T GCTA 
AAGCATCATCAGAAGTGAACGTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGG T T GGAAAT AATAC CATC CAT GT GCAC C GAGAAAT T C AC AAGAT AACC AACAACCAGAC 
TGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATC C T GC T GCC CAGAGAGG TC C C C GAT GAG TAT G AAG T T GG CAGC T T CAT G GAGGAG G T G CG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
$ TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
O CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
If! CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 



w 



fn 



^■4 

fU AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAAC C TGCAGAAACAGTAC T T AGGTAAT T GTAGGGC GAGGAT TATAAATGAAAT T T GC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T GTATAAGCAT GCTT T C TT T GAG T T T TAAAT TAT GTATAAACATAAGT T GCAT T TAGAA 
ATCAAGCATAAATCACT T CAAC TGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEME AEEAAAKAS SEVNLANL P PS YHNE TNT DTKVGNNT IHVHRE I HKI TNNQTGQMVFS E 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGT I CDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDL I TWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDE-YEVGSEMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI • 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100/ 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
y, CAAG TGCTGC CTCTGGTGTCT GGAAAAAT T TAT CAAGT T C C T AAACC GCAATGCATACAT CATGA 

P TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
|f ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
Tl GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
%J ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
||| GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
4* GGAAGACCTGGAGGGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
!L AGAT TCT GGGCAAGAAGAACGAGGCGCC CCCGGACAACAAGAAGAGGAAGAAGTGACAGC T CCGG 

^ CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

v.? s 

j s | CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

11} TTGAGAGGGTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

CI AAAC CTCCGTCTC TAT T AAAAAT AC AAAAAT T AGC C GAG AG T GG T G GCAT GCAC C T G T CAT C CC A 

ni 

l4# GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAA.CAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT T T TGT TAACT C 
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FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I S AFI RT LRYHTGS LAFGAL I LTLVQ I ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLS FFFFSGRI PGLGKDFKS PHLNYYWLP IMTS I LGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features : 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Ami da ti on site. 

amino acids 306-310 



APP ID=1 0063734 



Page 164 



FIGURE 11 



ass, 
ii 5 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 



W ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 



- y 



ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCT.CCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GAC TTCGTGCCT TACT GAG T C T C TAAGAC T T T T TC T AAT AAAC AAGC C AGT GC GT GT AAAAAAAA 
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FIGURE 12 

MGAC LGAC S LL S CAS C LC G SAP C I LC S C CPAS RNS TVS RL I FT F FL FLGVLVS I IML S PGVE S QL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWWICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGAC T CC T GTAACCC TCCT CCAGGATGAACCACCTGC CAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCC TATGAAGGAAGGGAAAAGAAAGGCATAT CT GATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGG AGGC AT TGAG AACACAT T AGAG AAGGAGGT GAT GC AGTATGAC TAC TAT T C T T C ATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACTATGAGTACTACT TTTGT TAAAT GTGAAAAACCC TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGCTGGCTT TAT T GAACAGC T AAT AAAGAT T TAT T TAT T G T AAT AC CT CAC AAAC GT T GT AC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T TAATATAT T GAAGTAACACT T T TT TAGT AAGCAAGAT AC CT T T TTAT TTCAAT TCAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATG7VAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
C TAAAT T AAAAA 
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FIGURE 1A 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI I ELNVNGG I ENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAIALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTG.GGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
, : CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
Q TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
O CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
ffl TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
fj TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
fi TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
HI CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCGAG 
III CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
>* TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
J CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
^ ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
O GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
If! GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
fl AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
|S GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
y GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCAGC 
fll CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCC'CTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGAC TCTAGT C T TGGCCCCAGGAAGC GAAGGAACAAAAGAAAC T GGAAAGGAAGATGC 
T TT AGGAACATGT T TTGCTTTTT TAAAATATATATAT T TATAAGAGAT CCT T TC CCAT TT AT TC T 
GGGAAGATGTT TT TCAAACTCAGAGACAAGGAC T T TGGT T TT T GTAAGACAAAC GAT GAT AT GAA 
GGCC T TT TGTAAGAAAAAATAAAAGATGAAGT GT GAAA 



APP ID- 



10063734 



Page 169 of 322 



FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHSITNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

T VT QLRPNAT Y S VC VMP L G PGRVP E GEE AC GE AHT P PAVH SNHAP VT QARE GNL P L L I APAL AAV 

LIAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGS 

CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites . 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGAT GAATCCT T AGAT TCCAAGACTACT T TGACATCAGATGAGT CAGTA 
AAGGAC CATACTAC TGCAGGCAGAG TAGT T GCT GGT CAAAT AT T T C T TG AT T C AGAAGAAT C T GA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT TT CTAGAGTCTCCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCC TAGATAAGGAGTAT GATGAATGTACATCAGAT GGGAGGGAAGATGGCAGACTGTGGT GTG 
M C T AC AAC C TAT G AC T AC AAAG C AG AT GAAAAG TGGGGCTTTTGT GAAAC T G AAG AAG AG G C T G C T 

!f:! AAGAGACGGCAGATGCAGGAAGCAGAAAT GAT GTAT CAAACTGGAATGAAAATC C TTAATGGAAG 

ll I CAAT AAGAAAAGC C AAAAAAGAGAAGC AT ATC G G TAT C T C CAAAAGGCAG C AAGCAT GAAC CATA 

\j CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
W GC AGC GAGAGAGAT G T T T GAGAAGC T GAC T GAGGAAG GC T C T C C C AAG G G AC AGAC TGCTCTTGG 

^ CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
I,, TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
Iff AATAATAT TAACAT CAGAAGAAT TT GTGG T T TATAGCGGC CACAACT T T T T CAGC TT TCAT GAT C 

Q CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

Pi AACACAT GGAAT C TACATGTAAATGAAAGT T GGTGGAGT CCACAAT T T TT CT TTAAAAT GAT TAG 

?-\ 

I::! TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

III 

GAAT TGT CAGAATCATTTT TTACATTAGAT TATCATAAT TT TAAAAAT TT TTCT TTAGTTT TTCA 
AAAT T TTGTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATAT TT TGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAA&AAAAAAAAAAAAAAA 
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FIGURE 18 



mrvrigltlllcavllslasassdeegsqdesldskttltsdesvkdhttagrwagqifldseesel 
essiqeeedslksqegesvtedisflespnpenkdyeepkkvrkpaltaiegtahgepchfpflfldk 
eydectsdgredgrlwcattydykadekwgfceteeeaakrrq 

eayrylqkaasmnhtkalervsyallfgdylpqniqaaremfeklteegspkgqtalgflyasglg™ 
s sqakalvyytfgalggnliahmvlvsrl 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 2 67-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 

GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAA ATG CTCTTTTGGGTGCTAGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 

AAAAAGGGAT T TCATGT AATCGC TGCCT GT CT GAC T GAAT CAGGAT CAACAGC T T TAAAGGCAGA 

AAC C T C AG AG AGAC T T C G T AC TGTGCTTCTG GAT G T G AC C GAC C CAG AG AAT G T C AAG AG G AC T G • 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 

CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

ATGCATT GAAC CAGGAT T GTT CAAAACAAACT TGGCA-GAT CCAGTAAAGG TAAT TGAAAAAAAAC 

TCGCCAT TTGGGAGCAGCT GTC TCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGACCACGC.TC T AACAAGTC TC T T CCCT AAGAC T CAT TAT GCCGC T GGAAAAGAT GCCAAAA 

T TT TC TGGAT ACC TC TGT C T C ACAT GC C AGCAGC T T T GC AAGAC T T T T TAT T GT T GAAACAGAAA 

GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

AT CAT C T C T T AT C T AAAT AT T AAAAGAT AAG T C AAC C C AAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAET SERLRTVLLDVTDPE3WKRTAQWVKNQVGEKGLWGL INNAGVPGVLAPTDWLTLEDY 
RE P I EVNL FGL I S VT LNML PL VKKAQGRV I NVS S VGGRL AI VGGG Y TPS KYAVEGFNDS LRRDMK 
AFGVHVS C I E PGL FKTNLADP VKVI EKKLAI WE QL S P D I KQQ YGE G Y I E KS LDKLKGNKS YVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 13 6-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT TCAGAAATATAT TCCATGCTATCAGCT TT TTAGCTTT TATAATTCT TCAGGCGAAGTAAATG 
AGCAAGCACT.GAAGAAAAT AT TATCAAAT GTCAAAAAGAATG T GGTAGGT T GGT ACAAAT TCCG T 
CGTCAT TCAGATCAGATCATGACGT TTAGAGAGAGGCT GCT TCACAAAAACTTGCAGGAGCAT T T 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAAT7VACAGAAAGCTGCTCTACTC 
!S l. ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
J GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
3 T T T TAGCCGAGCAG TACAAACACACAGCT CTAAAT T T TT TGAAGAAGAT GGATCC TTAAAGGAGG 

3 J T ACAT AAG AT AAAT GAAAT G T AT GC T T CAT T AC AAG AG G AAT T AAAG AG T AT AT GC AAAAAAG T G 

.3..-! 

'] GAAGACAG T GAAC AAG C AG T AGAT AAAC T AG T AAAG GAT G T AAAC AGAT T AAAAC GAGAAAT T GA 

y GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATC CAAAAAGACCC T CAGGAGA 

C ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
;!f AGAC AAT C T G AC C T T AAT G G T AG AAC AC AC T G ACAT T C C T GAAG C TAG T C C AGC TAG T AC AC CAC 

AAAT CAT T AAGCATAAAGCCT TAGAC T TAGAT GACAGAT G GCAATT CAAGAGATC TC GGT TG T TA 
GATACACAAGACAAACGAT CTAAAGCAAATACT GG TAGT AGTAACCAAGAT AAAGCAT CCAAAAT 
GAGCAGC CCAGAAACAGAT GAAGAAAT T GAAAAGATGAAGGGT T TTGG T GAATAT TCACGGT CT C 
C TACAT T T TGA TCC T T T T AAC C T TACAAGGAGAT T T T TT TATT TGGCT GAT GGG TAA&GCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT T GAAGC TAGGAGT T AGAGACCAGCCTGGGCAACGT AT T 
GAGACCAT GTC TAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCCTTAT T T TCAAAATAT GGAAA 
GAAAT T TAT AT G AAAAT T T AT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T TAG AAG T A 
CAT TATGGC TAGAGT TGCCAGATAAAAT GCTGGATAT CAT GCAATAAAT T T GCAAAACAT CATC T 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 

MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q IQAAREKNI QKDPQENI FLCQALRT FFPNSE FLHS CVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDI PEAS PAS TPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE IEKMKGFGEYSRSPT F 

Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites . 

amino acids 15-19, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCG C GGC GGAG GGCAGAGT CAGC CGAGC CGAGT C CAGC CGGAC GAG C GGAC CAG CGCAGGGCAGC C GAA 

GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 

GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 

GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTT.CTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 

CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCGTCTTCCACCAACTCCAACTCCTTCTCCC 

TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 

GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 

AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 

AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 

CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGT AAAGATGATT TCAAAAGT GT GGT CAGC GAACAGT GCAAT CAT T T G 

CAAGCT GT CTT T GCT T C ACGT TACAAGAAGT T T GAT GAATT C T T CAAAGAACT ACT TGAAAAT GCAGAGAAAT CCCT G 

AAT GAT AT GTT T GTGAAGAC ATAT GGC CAT T TATACAT GCAAAAT T CT GAGCTATT TAAAGAT CT CTT CGTAGAGT T G 

AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 

TTCCGCCTGGT GAACT C CCAGTACCAC T TTAC AGAT GAGT AT CT G GAAT GT GT GAGCAAGT ATAC GGAGCAGCT GAAG 

CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 

TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 

AT GAT CTACT GCT CCCACTGC CGGGGT CTC GT GACT GT GAAGC CATGTTAC AACT AC T GCT CAAACAT CAT GAGAGGC ■ 

TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 

GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 

AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 

TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 

AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 

GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 

CT GTT T GCAGT GACAGGAAAT GGATTAGC CAAC CAGGGCAACAACC CAGAGGT CCAGGTT GACAC CAG CAAAC CAGAC 

ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 

GACTT CT TTGATAT CAGT GAT GAAAGT AGT GGAGAAGGAAGT GGAAGT GGCT GTGAGTAT CAGCAGT GCCCTT CAGAG 

TT T GACT ACAATGCCACT GACCATGCT GGGAAGAGT GC CAAT GAGAAAGC C GACAGT GCT GGT GT CCGT C CTGGGGCA 

CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 

AAAAAGT GT T CAT CAAAAAGT T AAAAGGCAC CAGTTAT CAC TTTT CTACC AT CCTAGT GACTTT GCTT TTTAAAT GAA 

T GGACAACAAT GT ACAGT TTT T ACTAT GTGGCCACT GGTT TAAGAAGT GCT GACTTT GTTT TCT CATT CAGTT TT GGG 

AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 

CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 

TTTTTTTC CAACT GT GATCT CGC CTT GTTT CTTACAAGCAAAC CAGGGTCC CT T CT T GGCAC GTAACAT GTACGT ATT 

T CT GAAATATT AAATAGCT GTACAGAAGCAGGT TT TAT TTAT CAT GT TAT CT TATTAAAAGAAAAAGC C CAAAAAGC 
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FIGURE 24 



MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMIMVAERLEGPFNIESVMDPIDVKISDAim 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSOTCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPD I L I LRQ IMALRVMT S KMKNAYNGNDVD FFDISDESSGEGSGSGCEYQQCPSE FD YNAT DHAG 
KS ANEKADS AGVRPGAQAYLL TVFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site, 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N -myr i s toy 1 a tion s i tes . 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



• if ft 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C T GAGT CATC CCCAGGGAT CAGGAGCCT CCAGCAGGGAAC CT TCCAT TAT AT T C T T CAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCT T C TAGGAGATGGCT CCAGGAAGGCGGCCAAGAAT GT GAGT GCAAAGAT TGGT TC C TGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCAC CACAGAAAGC CAAACAAGCATT C CAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AG C TAG T G T CAT T T AAC C T T AAAT GCAAT C AG G AAAG TAG C AAAC AG AAG T C AAT AAAT AT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVL I SS LLLLL PLMLMSMVS S S LNPGVARGHRDRGQASRRWLQEGGQECECKDW FLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGC GCC T GCAGAGGC TGAGCAGGGAAAAAGCCAGT GCCCCAG CGGAAGCACAGC T CAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC TCAAAGAGGCAGAGGT T T TGT T CT CAAATAT T TT T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features : 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATGC TGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTT GT T TAT GAT T C T T ATAC TAAT T T AT AC AAAGAT AT T AAGGCCCT G T T CAT T AAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAkAAAAAA 



fl 



ru 
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FIGURE SO 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



, , Growth factor and cytokines receptors family: 

#=! amino acids 3-18 

1 
Cfi 
|y 

W 
£3 

m 
o 

CO 

o 
ru 



APP_ID=1 0063734 



Page 184 of 322 



FIGURE si 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATC ATG TCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAAC'GACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
1^ AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
S TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
p| ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
U:i AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

hi 

*2 TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
/ GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAA.GAACAGCAGCATGGTGCATTGT 
Q CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
ill AACCAGAATGCATATACTACAACTGC TATTAATGGGACAGAT TTCTGTACATCAGCAAAA.GAT GC 

W ATTCAAAA.TCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
pi TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
f|J AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GATGGATCGTCAGAAAAGCCC TAC T TT AT GGATCAAGAAT T T C TGAGTT T CGTA 
AAAAGGAG CAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCACT CATTAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAG ATAGA TACCCATTTAGGTATCTGTACCTGGAAAACATT 
TCC T TCTAAGAGCCATT TACAGAATAGAAGATGAGACCACTAGAGAAAAGT TAGTGAATT TTTT T 
T T AAAAG AC C T AAT AAAC C C TAT T C T T C C T C AAAA 
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FIGURE 32 

MSGRDTILGLCILALALSIJ^MFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQE FLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT TT G TATGGGAT TT GTGATGCAGGAAAGCC TAAGGGAAAAAGAAT AT TCAT TC T GT GT GGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTT ATGA GGAC 
' TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT T AAAAGGC CCAAGT T CACT GT GC CT CAGAT CAAC TGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TAT CGTC T TAGAAAGTAAACCCAAAAAGGG TGTAACCT AC CCAT CAGCTCT TACATAC T CAT CAT 
CGAAAAGT CCAGCT GCCCAAGCAGG TGAGACCACAAAAGC C TAT CAGAGGCCACC TAT T C CAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
^ CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
.Jjf GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
%3 AGAGC TGAT C CAGG TAT CCAAAGGCAAGAT CC T TCAGGAGCTGC CT TCCAGAAAC CT GT T GGAGC 

01 GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
yj TGGGAGAT CCAAAC T GCAAAATT GAC T TGT CGTT TT TAAT TGAT GGGAGCACCAGCAT T GGCAAA 

Ijl CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
W ACAC GAAT T CTCGAGAT C TGAAGACAGCCATAGAGAAAAT T AC TC AG AG AG GAG GAC T T TC TAAT 

■M GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
~ GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
f«| GAC T T G C GAG AG AG T C AG GAAT CAAC AT T T T C T T CAT C AC CAT T GAAGG T G C T GC T GAAAAT GAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
Wi GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
0 ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
||j GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
S2jj AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
2; TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
G T CC AAGCCCAACAAGAGGAAGT TAATGAT CC T CATCACCGAC GGGAGGT C C TACGACGACG TCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGC TAGAAG T CAT T GCCAC T CACCCCG CCAGAGACCACT C CTTCTTTG T GGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 
AGCCTCGG7UVCTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GAT CACAAACGTATAGAATGAGCCAAAAGGCTACAT CAT GT T GAGGGT GC TGGAGAT T T TACAT 
T T T GACAATT G T T T T CAAAATAAAT G TT CGGAATACAGT GCAGCCC T TACGACAGGC T TACGT AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T GT CATGACAATGTAGGAATT GC TGAAT TAAAT G TT TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAZVAARAAAAAAAZ^AAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 

C QDPKYHVYGT DVYAS Y S S VCGAAVHS G VLDNS GGK I LVRKVAGQ S G YKG SYS NGVQS L S L PRWR 

ES FI VLE SKPKKGVT YPSALT YSS SKS PAAQAGET TKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 

TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 

VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 

GPAGPMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAPOTVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYVVEPNFANKAVC 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 

LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 

FKKSKPNKRKLMILITDGRSYDDVRIPAiyiAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 

VDE FDNLHQYVPR 1 1 QN I CTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites . 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571 r 577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 
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FIGURE as 



CCGAGC AC AG GAGATTGC CT GCGT TT AGG AGGT GG C TGCGT T GT GGGAAAAG CT AT C AAGGAAG AAAT TG C 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAAC AAAAAACT T AAGCT T T AATT T CAT CT GGAATT CC ACAGT TTT CT TAGCTCCCT GGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCT TCCC CACT AC AATGT G AT AGAACGCGT G AAC T GGAT GT AC TT CT AT GAGT AT GAGCCGAT TT A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
H TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
CI TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
|;f AT T GTC CT T AGAGGAT GAAC AC CT TCT TT ATGGTG ACAT AAT CCGAC AAG ATT T T T T AGACACAT AT AAT A 

fl ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
J! AAG AC AGACACT GAT GTT TT CAT CAAT ACT GGCAATTT AGT G AAGT AT CT TT T AAACCT AAACC AC TC AGA 

s s ? GAAGT T TT T C AC AGGT TATCCTCT AATT GAT AATT ATT CCT AT AGAGGAT TT TACCAAAAAAC CCAT AT T T 

JZ CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
* TT GGTGCCAAGGAT CT AT GAAATG AT GGGT C ACGT AAAACCC AT C AAGTT TGAAGAT GTT T AT GTC GGGAT 

Q CTGTTT GAATTT AT T AAAAGTG AACAT TC AT ATT CC AG AAGAC AC AAATC TTT T CT TTCT AT AT AGAATCC 

111 AT TT GGAT GT CT GT C AAC TGAGAC GT GTGATT GC AGCC CAT GGC TTT T CT TCC AAGGAGATC AT CACT TT T 

TGGC AGGT C ATGCT AAGG AAC AC C AC AT GC CATT AT TAACT T C AC AT T CT ACAAAAAGCC T AGAAGGAC AG 
GAT ACCTTGTGGAAAGT GTT AAAT AAAGT AGGT ACT GTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

1-1 

AC AC TG AACT GAAACT C ATGAAAAACCC AGAC T GGAGACTGG AG GGTT AC ACTT GT GAT T T ATT AG TC AGG 
CCCT TC AAAGAT GAT AT GTGGAGGAAT T AAAT AT AAAGG AAT TGGAGGTT T TTGCT AAAGAAAT T AAT AGG 
ACC AAAC AAT TT GGAC AT GT CAT T CT GT AG AC T AGAAT TT CT T AAAAGGGT GT T AC T G AGT T AT AAGCT C A 
CT AGGC T G T AAAAAC AAAAC AAT GTAGAGT TT T AT T TAT TGAAC AATGT AGTCACT T GAAGGTTT TGT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATT AT T T AAAAT TACT T C AACTT T GT GT TT TT AAAT GT T TT G ACGAT T TC AAT AC AAG AT AAAAAG GAT AG 
TGAATC ATT C TT T AC ATGCAAAC ATT TT CC AGT TACTT AACT GAT CAGTT T AT T AT T GAT AC AT CACT CCA 
TT AATGTAAAGT CAT AGGTCATT ATT GC AT AT CAGT AAT CTCTT GGAC TT T GT T AAAT AT TT T AC T GT GGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features : 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites . 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 



n|>s Glycosaminoglycan attachment site. 

* _ amino acids 239-243 

jwj Ly-6 / u-PAR domain proteins. 

rp amino acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 




CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGT ACACAGTCATTAATGAAGCC TGCCC TGGAGCAGAGT GGAATATC AT GTGTCGGGAGT GCTGT G 
AATATGATC AGAT TGAGTGC GTC TGCC CCGGAAAGAGGGAAGTCGTGGGTTATACCATCCC T TGC TGCAGGAATGAGGAGAA 
TGAGTGTGAC T C C TGC C TGATCC ACCC AGGT T GTACC ATC T TTGAAAAC TGCAAGAGCTGC CGAAATGGCT C ATGGGGGGGT 
ACCT TGGATGAC T TCTATG T GAAGGGGT TCTAC TGTGCAGAGTGCCGAGCAGGC T GGTACGGAGGAGAC TGC ATGCGATGT G 
GCCAGGT TC TGC GAGC CCCAAAGGGTCAGAT T T TGTT GGAAAGC TATCCCC TAAAT GC TCAC TGTGAAT GGACCAT TCATGC 
TAAAC C TGGGT T TGT C ATCCAAG TAAGAT T TGTCATGTT GAGTC TGGAGTT T GAC TAC ATGTGCC AGTAT GACTAT G TTGAG 
GT TCGTGATGGAGAC AAC CG CGATGGCC AGAT CAT C AAGC GTGTC TGTGGCAACGAGCGGCCAGC TC C TATC CAGAGCATAG 
GATC C TCAC TC C ACGTCC T C T TCCACTC CGAT GGC TC CAAGAAT T TTGACGGT TT C CATGC CAT T TATGAGGAGAT CACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGGGCCCAGTCAATGGGTACCAGAAAA 
TAAC AGGGGGC CC TGGGC T TATCAACGGACGC C AT GC TAAAATT GGCACCGTGGTGTC T T T C T TT TGT AAC AAC TC C TATGT 
TC TT AGTGGCAATGAGAAAAGAAC T TGCCAGC AGAAT GGAGAGT GGTCAGGGAAACAGCC CATCT GC AT AAAAGC C TGCCGA 
GAACC AAAGAT T TCAGACC T GGTGAGAAGGAGAGT TC T TC C GATGCAGGTT CAGTC AAGGGAGAC ACCATTACACC AGC TAT 
AC T C AGCGGC C T T C AGC AAGC AGAAAC TGCAG AGT G C C C C T AC C AAGAAGC C AGC CCTTCCCTTT GG AG AT C T GC C C ATGGG 
ATAC C AAC ATC T GCAT ACCC AGC TC CAGTATGAGT GC ATC TCACCCT T C TACC GC C GCC TGGGC AGCAGCAGGAGGACATGT 
C TGAGGACTGGGAAGT GGAGTGGGC GGGCACCATC C TGC ATCCC T AT C T GCGG GAAAAT TGAGAACATC AC T GCT C CAAAGA 
CCCAAGGGT TGC GC TGGCC GT GGCAGGCAGCCATC TACAGGAGGACCAGCGGGGT GC ATGACGGC AGCC TAC ACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

GTC ACCATGAT C AAGACAGC AGACC TGAAAGT T GT TT TGGGGAAAT TC TAC C GGGATGATGACCGGGAT GAGAAGAC CATCC 
AGAGCCT AC AGAT T TC TGC T ATC AT TC T GCAT C CCAAC TATGACCCC ATCC T GCT T GATGC TGAC AT CGCC AT CC TGAAGC T 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCCCACATC AC TGTGGC TGGC TGGAAT GTCC T GGCAGACGTGAGGAGC CCT GGCTT CAAGAACGACAC AC TGC GC T CTGGGG 
TGGTCAGTGTGGTGGAC TCGC TGCTGTGTGAGGAGCAGCATGAGGACC ATGGCATC CC AGTGAGTGT C AC TGATAAC AT GT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATC TCC T GAGC CACGC TGGC ATC TGAT GGGAC TGG TC AGC TGGAGC TATGATAAAACATGCAGCCACAGGCTCTCCA 
CTGCCTTCACCAAGGTGCTGCCTTTTAAAGAC TGGAT TGAAAGAAAT ATGAAATGAACCATGC TCATGCACTCCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GT GC CAGGGCT TC TGAC T TC AGGGACAAAAC T C AGTGAAG GGTGAGT AGAC C TC CATTGC T GGTAGGC TGAT GCC GCGTCC A 
CTAC TAGGACAGCCAAT T GGAAGATGCC AGGGC TT GC AAGAAGTAAGTT TC T TCAAAGAAGACCATAT ACAAAAC C TCTCC A 
CTCCACTGACC TGGTGGTC TTCCCC AAC T T TC AGT T ATAC GAATGCC AT CAGC T T GACC AGGGAAGATC TGGGCT TCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQS IGS SLHVLFHSDGSKNFDGFHAI YEE I TACS S S PCFHDGTCVLDKAGS YKC 
ACLAGYT GQRCENLLEERNCSDPGGPVNGYQK I TGGPGL I NGRHAK I GT WS FFCNNS YVLS GNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QS LQI S AI I LHPNYDP I LLDADI AI LKLLDKARI S TRVQP I CLAASR 
DL S T S FQE SH I TVAGWNVLADVRS PGFKNDTLRS GWS WDS LLCEEQHE DHG I PVS VT DNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100,- . 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTT C CTACATC C T CTC AT CTGAGAAT CAGAGAGCATAAT CT T CTTAC GGGCC C GT GAT TTAT TAAC GT G GCT TAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAG GT GCCT C CCACTAC GGCCT GAC CAAAGAT AGGAAGAGGC GCT CACAAGAT GGCT GT C CAGAC GGCT GTGCGAGC C 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCT GCCTACGT GTCCT CG GCAGAGGAC GGG CAGC CAGCAAT CAGCCCAGT GGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGT GCAGTT GCCAAC CAT GC CGACC AGGGCAGGGAAAATT CTGAAAAC ACCACT GC CC CTGAAG 
m T CTT T CCAAGGTT GTAC CACCT GATT C CAGAT GGT GAAATTAC CAGCAT CAAGAT CAATC GAGTAGAT CCCAGT GAAA 

f*i GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 

•pat.? 1 

Q TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGA(^TCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
ff| ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
' |jr| GCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCGAGATGACAGCTTTC^ 
\! CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
h j . GT GTGGCATAT CGACATGGT CAGCTT GAGGAGAAT GACCGT GT GTTAGCCAT CAAT G GACAT GAT CT TCGATAT GGC A 

JZ GCC CAGAAAGTGC GGCT CAT C T GATT C AGGCCAGT GAAAGAC GTGTT CAC CT C GT C GT GT CC CGC CAGGTT C GGCAGC 

S GGAGCCCT GACAT CTTT CAG GAAGCCGGCT GGAACAGCAAT G GCAGCT GGTCC CC AGGGC CAGGGGAGAG GAGCAACA 

Ff CT C CCAAGCC CCTCC AT CCTAC AAT TACTT GT CAT GAGAAG GT GGTAAATAT CCAAAAAGACCCCGGTGAATCTCTCG 

y| GCAT GAC CGT CGCAGGGGGAGCATCAC ATAGAGAAT G GGATT T GC CT ATCT AT GT CAT CAGT GTT GAGCC C GGAGGAG 

Q TCATAAGCAGAGATGGAAGAATAAAAACAGGT GACAT T TT GT T GAAT GTGGAT GGGGT CGAACTGACAGAGGT CAGC C 

III' GGAGTGAGGCAGTGGC^TTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGC 
Q CCCAGGAAGACT GCAG CAGC CCAGCAG CCCT GGACT CCAAC CACAACAT GG CC CCACC CAGT GACT GGTC CC CAT CCT 

f|j GGGT CAT GT GGCT GGAATTACCACGGT GCT T GTATAACT GTAAAGATATT GTATTACGAAGAAAC ACAGCT GGAAGT C 

TGGGCTT CT GCAT TGTAGGAGGT TAT GAAGAAT ACAAT GGAAACAAAC CTTT TTT CAT CAAAT CCATT GT TGAAGGAA 
CACCAGCATACAATGAT GGAAGAATTAGAT GT GGT GATAT T CTTCTT GCT GT CAAT GGTAGAAGTACAT C AGGAAT GA 
TACAT GCTTGCTT GG CAAGACT GCT GAAAGAACTTAAAGGAAGAATTACT CT AACTATTGTTT CTT GGC CT GGCACTT 
T T TT ATAGA AT CAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAAT AG GCT AAGAAGTT GAAACACT ATAT TTAT C 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCTCAGAAAATATGATTCCAAAAAA^TTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCT GATTTAAAT T T AAAAT TTG GTATAT GCT GAAGTCT GC CAAGGGTACATT AT GGC CATTTT TAATT T ACAGCT 
AAAATATT TTT TAAAAT GCATT GCT GAGAAACGTT GCT TT CAT CAAACAAGAATAAATATT TT TCAGAAGT TAAA 
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FIGURE 40 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAYVS SAE DGQPAI S P VDS GRSNRTRARP FERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHII IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKS S PEEQLGIKLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myr i s toyl a ti on s i tes . 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACC AGGC AT T GT AT C T T CAGT T G T CAT C AAGT T CGCAAT C AGAT T GGAAAAGCT C AAC T T GAAG C TT T 
CTTGCCTGCAGT GAAGCAGAGAGATAGAT ATTATT CACGTAATAAAAAACATGGGCT TCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CAC CAGT AACT AC T T C G TGGGTG CC AT T CAAGAGAT T CC T AAAGC AAAGGAG T T CAT G G C T AAT T TCC 
ATAAGACCCT CATTTTGGGGAAGGGAAAAACT CTGACTAAT GAAGCATC CACGAAGAAGGT AGAACT T 
GAC AAC TGTCCTTCTGTGTCTCCT T AC C T CAG AGG C C AGAGCAAG CT CATT T T C AAAC C AG AT C T CAC 
T TT GGAAGAGGTACAGGCAGAAAAT CCCAAAGT GTCCAGAGGC CGGT AT CGCCCTCAGGAATGTAAAG 
CTTT ACAGAGGGT CGCCAT CCTCGTTCCC CACCGGAACAGAGAGAAACACCT GAT GTACCT GCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
X AAAAAGT T T AAT CGAG CCAAAC T CT T GAAT G T GGG CTAT C TAGAAGC C CT CAAGGAAGAAAAT T GGG 
CI ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATC CCAAGCAT CTGGT GGT T GGCAGGAACAGCACT GGGT ACAGGTTACGT TACAGT GGAT ATT TTGG 
ft GGGT GT TACTGCCCTAAGCAGAGAGCAGT T TT T CAAGGT GAAT GGATT CTCTAACAACTACTGGGGAT 

g .1 G GGGAGGC GAAGACGAT GAC C T CAG AC T CAGG G TT GAGC T C CAAAGAAT GAAAA.T TTCCCGGCCCCTG 

JZ CCTGAAGTGGGTAAATATACAAT GGTCTT CCACACTAGAGACAAAGGCAATGAGGT GAACGCAGAACG 

s GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
CI TAG TAT C T GT GGAAC AC AAT C C T T TAT AT AT C AACAT CAC AGT GGAT TTCTGGTTTGGT GCATGACC C 

TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AATGAAC GCTAT T TGAGGACTCT GGT T GAAGGAGAT T TAT T TAAATT T GAAG TAAT ATAT T AT GGGAT 
AAAAGGCCACAGGAAATAAGACT GCTGAATGTCTGAGAGAACCAGAGTT GTTCTCGTCCAAGGTAGAA 
AGG T ACGAAGAT ACAAT AC T G TT AT T CAT T TAT C C T GTAC AAT CAT C T GT GAAGT GGT GG T GT CAGGT 
GAGAAGGCGTC CACAAAAGAGGG GAGAAAAGGCGACGAAT CAGGACACAGT GAAC T TGGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TT TAAAGAGT T T T T GT AAAAT GAT T T T G TACAAGT AGGATAT GAAT T AGCAG TT T ACAAG T T T ACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 



m 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQE I PKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 4.3 

GCTCAAGACC CAGCAG T GGGACAGCCAGACAGACGGCAC GATGGCAC T GAGC TC CCAGAT CTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACT TGCAGAGCT GCAAC CCCAGGACAGAGCT GGAGCCAGGGC CAGC TGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE dd 



^3 



in 



MM.SSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE as 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATGG CTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACT T CCCAGAT GGAGGCTACTCCCT GAAGCTCAGCAAAC T GAAG 
AAGAAT GACT CAGGGATC TACTATGTGGGGATATACAGCT CAT CAC T CCAGCAGCCC TCCACCCA 
GGAGTACGTGC T GCAT GT C T ACGAGCAC CT GT CAAAGCC TAAAGTCACCAT GGGTC TGCAGAGCA 
ATAAGAAT GGCACC TGT GT GAC CAATCT GACATGC T GC AT GGAACATGGGGAAGAGGAT GT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
Q C TGGAGAT GGGGAGAAAGT GATATGACC T T CATC TGC GT T GCCAGGAACCC T G T CAGCAGAAAC T 

|f] TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
W GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
, *l GTT TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGTGGACAT TTGTCGGG 

w 

k AAAC T CC TAACATAT GCCCCCAT TC T GGAGAGAACACAGAG TAC GACACAAT CCC TCACAC TAAT 

£ ' AGAAC AATCC T AAAGGAAGAT CCAGCAAATAC GGT T TAC T CCAC TGTGGAAATACCGAAAAAGAT 

Q GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 

tff TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

Ill 
.SiK!:, 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKV1CQVDSIVWTFNTTPLVTIQP 
EGGT 1 1 VT QNRNRERVD FPDGGYS LKLSKLKKNDS G I YYVG I YS S S LQQPS TQE YVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFS S PI LARKLCEGAADDPDS SMVLLCLLLVPLLLSL FVLGLFLWFLKRERQEE YIE 
EKKRVDI CRET PNIC PHSGENTE YDT I PHTNRT ILKEDPANTVYSTVE I PKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
C TGCAACAACAGAACT GGAATGTT T C T T T CATCAT T T T T CAGTG TGATCACAGT CAT TGG T GCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAAT TTTCAT TGAAAAACATCAGTGACAT TCAT CCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
jM AC AC CAT GGC GAG T G G C T G GAGAGCAT C T AGT T T C C AC T T C GAT T C T GAAGAAAAC AAACAT AGG 

CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
IZ CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
s" g | T GTAG TTTAATGGGAATAAAATGTAAGTATCAGTAGTTTGAAAAAAAAAAA 

•2. 5 

W 

4= 

His 
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FIGURE 48 

MTCCEGWTSCNGFS LLVLLLLGWLNAI PL I VSLVEEDQ FSQNP I S C FE WWFPG 1 1 GAGLMAI PA 
TTMSLTARKFACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVG I LE VLFGLS Q I VI GFLGCLCGVSKRRS Q IV 

Important features : 
Transmembrane domains : 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 



Q N-glycosylation sites. 

CO amino acids 80-84, 132-136, 148-152, 163-167 

w 

■„ : cAMP- and cGMP-dependent protein kinase phosphorylation site. 

P amino acids 223-227 

lj N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

o 

?*% Prokaryotic membrane lipoprotein lipid attachment site. 

ffj amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGC AGTGAGGAAC CT GCAGAGT C AC ACAGT T GC T GACCAAT TGAGC T GT GAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T TGCCAATAAAGACGAT CCCT TCTAC TAT GAC T GGAAAAAC CT GCAGC T GAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGC ACAGT C CTG TACC T GAGAAGGCCAT C CC AC T CAT CAC T CC AGGC T CT GCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 



Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



HI 



GTGGAC T C TGAGAAGCCC AGGCAGT T GAGGACAGGAGAGAGAAGGC TGCAGACC CAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAG CAAGGGC GGCAAGG 
AGGAGACC CT GGTGGGAGGAAGACAC TC T GGAGAGAGAGGGGGC TGGGCAGAGATGAAGT T CCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCAT TCGACACGGAGCAGATGC TGTCCGCGGCT CCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
yj GAAACTCAGCAGGCAGC TT TGGAATGAAT CCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAAT GGA 

S| GGGCCACCAAAC T T T GGGACCAACAC T CAGGGAGC T G TGGCCCAGCC TGGC TAT GGT T CAGT GAG 

*M AGCCAGCAACC AGAAT GAAGGG TGC ACGAATC C C CCACC AT C T GGC T CAGG TGGAGGCT C C AGC A 

*** ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
ifi TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
0 CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
C T T CAGAGGACAGGGAGT T TCCAGCAACAT GAGGGAAAT AAGCAAAGAGGGCAATCGCCTCC TT G 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAA?UU ! iAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISBCEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

as amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

■ if S 

*p* Casein kinase II phosphorylation site. 

2 amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
H GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
^ CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 

Ci 

CAGAAAT GCAT C T TCAGC T T T GACAGCCAT TGT CAGGAGAGGC CCAGT GAAT ATATTGCCACCAT 
l'l C T T GGAGC T CAG TGCCC T T GTAGAGAAAAGAAGCCAGCATATC C TCCAGCACATGGACTT TCTGT 

\| ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

III GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

r 

*r AGC CAAGT C CAAGAC T T T GGAT T TCAT T GATGTGC T T C T GC TGAGCAAGGATGAAGAT GGGAAGG 

J n CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
If! GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
C3 ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
SAJ AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTGCCTTCATC 
;!f! TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCAGCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE r& 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLIJ^IIiAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
L I T PTEEGLKDS TQMSAT YSQGFTVWLGP 1 1 P FI VLCHPDT I RS I TNAS AAI APKDNL FI RFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAP FI SRCCTQD I VLPDGRVI PKGI T CL ID 1 1 GVHHNPTVWPDPEVYDPFRFDPENSKGRS P 
LAFIPFSAGPRNCIGQAFAMAEMKWLAIMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features : 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids .112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CAT GGCCAGT T T TAT GAAGC T T TGGAAGGCACT AT GGACAGAAGC T GGTGG ACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GAT T CGAACAT T T GAGGGT TAC T T T T GGAAGCAACAATAC AT TC T C GAACCT GAATGT CAGT A 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AA?\AAAAAAAA?\AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *6 

MGPVKQLKRMFE PT RL I AT IMVLLC FAL TLCSAFWWHNKGLAL I FC I LQS LAL TWYS LS FI P FAR 
DAVKKC FAVCLA 

Important features : 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCC^GCAGGCCGC(^CATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGAT AAGGACCTT GAT GGG C AGCTAGACTTTGAAGAATTT GTCCATTATCTCCAAGAT CAT GAGAAGAAG CT GAGG 
CT GGT GT TT AAGAT TTT GGACAAAAAGAAT GAT GGACGC ATT GAC GCG CAGGAGATCAT GCAGT C CCT GCGG GACTT G 
GGAGT CAAGAT ATCT GAACAGCAGGCAGAAAAAAT T CTCAAGAG CAT G GATAAAAACGG CAC GAT GAC CAT C GACTGG 
AACGAGT GGAGAGACT ACCACCT CCT C CAC CC CGT GGAAAACAT CCCCGAGAT CAT CCT CTACT GGAAGCATTC CAC G 
AT CT TTGAT GT G GGT GAGAAT CTAACGGT CCCGGAT GAGT T CACAGT GGAGGAGAGGCAGACGGGGAT GT G GTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAG GT CCAT GCC TCC CGCAGCAACAACAT GGG CAT CGT T GGT GGCTT C ACTCAGAT GAT T CGAGAAGGAGGGGC CAGG 
T CACT CT GGC GGG GCAAT GGCAT CAACG T CCT CAAAAT T GCCC CCGAATCAGC CATCAAATT CAT GGCCT AT GAGCAG 
AT CAAGCGCCTT GT T GGTAGTGAC CAG GAGACTCT GAGGATT CAC GAGAGGCTT GT GGCAGGG.T C CTT GGCAGGGGC C 
AT C GC CCAGAGCAGCAT CT ACC CAATGGAG GT CCT GAAGAC CCG GAT GGCGCT GC GGAAGACAG GCCAGT ACT CAGGA 
AT GCT GGAC T GCGCCAGGAGGAT C CT GGCCAGAGAGGGGGTGG C CGCCT T CTACAAAGG C T AT GT CC CCAACAT G CT G 
GGCAT CAT CC CCTAT GC C GGCAT C GAC CTT GCAGT CTACGAGACGCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGT G 
AACAGCGC GGAC CCC GGCGT GTT T GTGCT CCT G GCCT GT GG CACCAT GTC CAGTACCT GT GGCCAGCT GGCCAGCTAC 
CCC CT GG CC CTAGT CAGGAC CC GGAT GCAGGCGCAAG CCT CTATT GAGG GC GCT C CGGAGGT GAC CAT GAGCAGCCT C 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACT CGCTGAT CCTGGGCCGCAGCCTGGGGT GTGCAGCCAT CTCATTCTGTGAATGTGCCAACACT 
AAGCT GT CT CGAG CC AAGCT GT GAAAAC CCTAGAC GCACCCGCAGGGAGGGT GGGGAGAGCT GGCAGGC CCAGGGC T T 
GT C CT GCT GACC C CAGCAGACCCTCCT GTT GGT T CC AGC GAAGACC ACAG GCATT CCTT AG GGTCCAGGGT CAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
AT AAT CCAT GAT GAAAG GT GAGGTCAC GT GG C CT CC CAGG C CT GACTT CC CAACCTACAGCATTGAC GCCAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGC CT GGACC CT GTCAG GAT GG GCCC CACC TCAGAACCAAACT CACT GTC C CCACT GT GGCAT GAG GGCAGT GGAGCA 
CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTT G GGAAAAGG GTT TT GT C CAGAAGGACAAGCC GGACAAAT GAGCGACT TCT GT GCTT CCAGAGGAAGACGAG G 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATTCCACTT GTGTCACT GCTTGGAACCTATTTATTTT GTATTTATTTGAACAGAGTTATGTCCT 
AACTAT TTTTATAGAT TT GTT T AATT AATAGCT TGT CATTT TCAAGT T CATT T TTTAT TC ATATTTAT GTT CAT GGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATTGTTATATATGAACATATAACT GGAGTCGTCAAAAAG 
CAAATTAAGAAAGAAT TGGACGT TAGAAGT T GT CATTT AAAGCAG C CTT CT AATAAAGT T GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQE IMQSLRDLGVKISEQQAEKILKSMDKNG 
TMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGWP^MLGIIPYAGIDLAVYETLKNAWLQHYAW 

GQLAS YPLALVRTRMQAQAS IEGAPEVTMS S L FKH I LRT E GAFGL Y RGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glyoosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE *Q 

GGA^GGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC T CAT CAT T GGC T T T GGTAT T T CAGG GAGACAC TC CATCACAGTCACTACTGTC GCCT CAGC T 
GGGAACATTGGGGAGGATGGAATCCTGAGCT GCAC TT TT GAACCTGACATCAAACTT TCTGATAT 
CG T GAT ACAATGGC T GAAGGAAGG T GT T T TAGGC T T GGT CCATGAGT T C AAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACATACT CC T GTAT GAT T GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGT G 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCC T T GGCCACAAAAAAGCAT GC AAAGT CAT T GT T ACAAC AGGGAT C T AC AGAAC TATT T CAC 
CACCAGATAT GACC TAGT TT TATAT T TC T GGGAGGAAAT GAAT T CATATCTAGAAGT CT GGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
C T AT CT T CAAAGACATAT T AGAAGT T GGGAAAATAAT TCAT GT GAAC TAGACAAGTG T GTTAAGA 
GTGATAAGTAAAATGCACGTGGAGACmGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT T CCCAACT GACAAATGCCAAAG T TGAGAAA&AT GAT CATAAT TT TAGCATAAACAGAGCAGT 
CGGGGACACCGAT T T TATAAATAAACTGAGCACCT TC TT T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 60 



MAS LGQ I L FW S 1 1 S 1 1 1 1 L AGAI AL 1 1 GFG I S GRH S I T VT T VAS AGN I GE DG I LS CT FE PD I KL S 
D I VIQWLKEGVLGLVHE FKEGKDELSE QDEMFRGRTAVFADQVI VGNAS LRLKNVQLTDAGT YKC 
YIITSKGKGNAKLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 

CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N- glycosylate on sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myr i s toyl a ti on s i te s . 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



T GAC G T CAGAAT CACCATGG C CAGC T AT C C T T AC C GG CAG G G C T GC CCAGGAG C T GCAG GACAAG 

CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 

GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 

TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 

GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 

GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 

GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 

AT T GGT C T T C AT T CAAT GATGAGAC C T GCC T CATGAT GAT AAACAT GT T T GAC AAGAC CAAGT C A 

GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 

AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGC7UVCATCCGGCTCAGCTTCGAGGACTTCGTCA 

CCATGACAGCTTCTCGGATGCTATGACCCAACCATC TGTGGAGAGTGGAGTGCACCAGGGACCT T 

TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAG7\AC 

ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 

AATAGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCC T GAT GGAGGAGAGGATAGAAGT TGA 

ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 

CT GACCAAGCATGAGAGAGATC TGTC TAT GGGACCAGTGGC TTGGAT T C T GCCAC ACCCAT AAAT 

CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 

CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 

TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 

AT TT GGGGCCAAAAGT CCAG TGAAAT TGTAAGC T TCAATAAAAGGAT GAAACTC T GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMIIN^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RL S FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-irryristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 



APP ID=1 0063734 



Page 216 



FIGURE 63 

GAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTGTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGC TGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATAT TAAT CGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CGTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^^ 
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FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLJ^VPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6 s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCGTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCN T 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

AC GGAC CGAG GGT T CGAGGGAGGGACACGGACCAGGAACC T GAGC TAGGT CAAAGACGCC CGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGT GCCC T C TC TGGGCAGT CAGAT C CACCCAGT GCT TAATAGCAGGGAAGAAGGTAC TTCAA 
AGAC TCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTT TTATATATT TATATAAAAT TAG 
TAGT GAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLIx^GLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAI IWFS 
LLAALLIAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

f 85 ! amino acids 56-80 



PI N-glycosylation site. 

.. amino acids 36-40 

W 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

O amino acids 8 6-90 

ill 

Tyrosine kinase phosphorylation site. 

J«J amino acids 86-94 

111 

N-myristoylation sites. 

amino acids 7-13, 2 6-32 
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FIGURE 6Q 



tar 



O 
ru 



GC C AGGAATAACTAGAGAGGAACAATGGGGTTATT CAGAGGTTT T GT T T T C CT CTTAGTTCT GT GCCT GCTGCAC CAG 
TCAAATACT T CCT T CAT TAAGCT GAATAATAAT GGCT TT GAAGATAT T GT CATT GTTATAGAT CCTAGTGTGCCAGAA 
GAT GAAAAAAT AATT GAACAAATAGAGGAT AT GGT GACTACAGCTTCTAC GTACCTGTTT GAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAAT GTAT CTATATTAATTCCT GAGAAT T GGAAGGAAAAT CCT CAGTACAAAAG G C CAAAACAT GAA 
AAC CATAAACAT GCT GAT GT TATAGTT GCACCAC CTACACTC C CAGGT AGAGAT GAAC CATACACCAAGCAGT T CACA 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGT CAAAAAAAAT C GAAGCAACAAGGT GT T CC GCAGGTAT CT CT GGTAGAAAT AGAGT TTATAAGTGT 
CAAGGAGGCAGCTGT CTTAGT AGAG CAT GCAGAATT GATT CTACAACAAAACT GT ATGGAAAAGATT GT CAAT T C TTT 
CCTGATAAAGTAC AAACAGAAAAAGCAT C CAT AAT GT T T AT GCAAAGTATTGAT T CT GT T GT T GAATTTT GT AACGAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACATTT CC TGCT GCAGACTGTT GAAAAT G GAT C CT GGGT GGGGAT GGTT CACTTT GATAGT ACT G CC ACTAT T GTA 
AATAAGCTAATC CAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT GGCAGGAT TACCTACATAT C CT CT GGGAGGA 
ACTT CCAT CT GCT CT G GAATTAAAT AT GCATT T CAGGT GATT GGAGAGCTACATT C CCAACT CGAT GGAT C C GAAGTA 
CTGCT GCT GACT GAT GGGGAGGATAACACT GCAAGTT CTTGTATTGATGAAGT GAAACAAAGT GGG GCC AT T GTT CAT 



If! TTTATT GCT TTGGGAAGAGCT GCTGAT GAAGCAGTAAT AGAGAT GAG CAAGATAACAG GAGGAAGT C ATTTT T AT GT T 

y I TCAGAT GAAGCT CAGAACAAT GGCCTCATT GAT GCTTTTGGG GCT CT TACAT CAGGAAATACT GAT CT CT C CCAGAAG 

f'-l TCC CTT CAGCT C GAAAGTAAGGGATT AAC ACT GAATAGT AAT GCCT GGAT GAAC GACACT GT CAT AATT GATAGTACA 

Si gtgggaaaggacacgttctttctcatcacatggaacagtctgcctcccagtatttctctctgggatcccagtggaaca 
1 3 1 ataatggaaaatttcacagtggatgcaacttccaaaatggcctatctcagtattccaggaactgcaaaggtgggcact 
% 2l tgggcatacaatcttcaagccaaagcgaacccagaaacattaactattacagtaacttctcgagcagcaaattcttct 
gtgcctccaatcacagtgaatgctaaaatgaataaggacgtaaacagtttccccagcccaatgattgtttacgcagaa 
attctacaaggatatgtacct gttctt ggagccaatgtgact gctttcatt gaatcacagaatggacatacagaagtt 
J w ttggaacttttggataatggtgcaggcgct gattctttcaagaatgatggagt ctactccaggtattttacagcatat 

acagaaaatggcagatatagcttaaaagttcgggctcatggaggagcaaacactgccaggctaaaattacggcctcca ' 
: f ctgaatagagccgcgtacataccaggctgggtagtgaacggggaaattgaagcaaacccgccaagacctgaaattgat 
J gaggatactcagaccaccttggaggatttcagccgaacagcatccggaggtgcatttgtggtatcacaagtcccaagc 
J cttcccttgcctgaccaatacccaccaagtcaaatcacagaccttgatgccacagttcatgaggataagattattctt 
acatggacagcaccaggagataattttgatgttggaaaagttcaacgttatatcataagaataagtgcaagtattctt 
gatctaagagacagttttgatgatgctcttcaagtaaatactactgatctgtcaccaaaggaggccaactccaaggaa 
agctttg^tttaaaccagaaaatatctcagaagaaaatgcaacccac^tatttattgcc^ttaaaagtatagataaa 
agcaatttgacatcaaaagtatccaacattgcacaagtaactttgtttatccctcaagcaaatcctgatgacattgat 
cctacacctactcctactcctactcctactcctgataaaagtcataattctggagttaatatttctacgctggtattg 
tctgtgattgggtctgttgtaattgttaactttattttaagtaccaccatt tgaa ccttaacgaagaaaaaaatcttc 
aagtagacctagaagagagttttaaaaaacaaaacaatgtaagtaaagg 

gtgtgatcataaactcataaaaataattttaagatgtcggaaaaggatactttgattaaataaaaacactcatggata 
t gt aaaaact gt caagattaaaatttaatagtttcatt tattt gttattttattt gtaagaaatagt gat GAAC aaag 
atc ctttttcatactgatacctggttgtatattatttgat gcaacagttttctgaaatgatattt caaattgcatcaa 
gaaattaaaatcatctatctgagtagtcaaaatacaagtaaaggagagcaaataaacaacatttggaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa^ 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVS I L I PENWKENPQYKRPKHENHKHADVI VAPPTLPGRDE PYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVI IDSTVGKDTFFL 
I TWNS LPP SIS LWDP S GT IMEN FTVDAT S KMAYL S I PGTAKVGTWAYNLQAKANPE T LT I T VTS R 
AANS S VP P I T VNAKMNKDVNS FP S PM IVYAE I LQGYVPVLGANVTAFI E S QNGHTE VLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWWNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRY I IRI SAS I LDLRDS FDDALQVNT TDLS PKEANSKE S FAFKPENI S EENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT C CTTAGGT GGAAAC C CTGGGAGTAGAGTACT GACAGCAAAGACCG GGAAAGAC CATAC GT CCC CGG GCAG GGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCTT CCCTT CTT GAACC CC CAGTTAT G CCAGGATTTACT AGAGAGT GT C A 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GAC TT G CCT GAGGGT GGAC C AGAAGAAAGGAAAG GT CC C CT CTT GCTGT T GGCT G CAC AT CAG GAA 
GGCT GT GAT G GGAAT GAAGGT GAAAACT T GGAGATTT CACTTCAGT CATT GCTTCT GC CT GCAAGAT CAT CCT TTAAA 
AGTAGAGAAGCTGCT CT GTGT GGTGGTTAAC T CCAAGAGGCAGAACT C GTT CTAGAAG GAAAT GGAT GCAAGCAG CT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 

C GCAACT ACGT GAGCAGCCT GAAGCGGCAGAT C GCACAGCT CAAGGAG GAGCT GCAGGAGAGGAGT GAGCAGCTCAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGC CTTCCTGCACT C GCAGGT GGAC AAGGCAGAG GT GAAT GCT GGC GT CAAGCT GGC CACAGAGTATGCAGCAGT G 
C CT TT CGATAGC T TTACT CTACAGAAGGTGT ACCAGCT GGAGAC T GGC CTTAC CC GCCAC CCC GAGGAGAAGC CT GT G 
AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
GAGCT CAC CTT C AAAGGG GAC CACAAACACGAATT CAAAC GGCTCAT CTTAT T T C GAC CATTCAGCCC CAT CAT GAAA 
GTGAAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTC 
C GGCAGTT CAT GC AGAATT T CAGGGAGAT GT GCAT TGAGCAGGAT G GGAGAGT CCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAAT AAAT GAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCT GC CAACT T CAGGAACTT TACCT T CAT C 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTATTTTAT CCAGT T CTT T T CAGT CAGTACAAT CCT GGCATAAT ATAC G GCCACCATGAT GCAGT CCCT CC C TTG GAA 
CAGCAGCT G GT CATAAAGAAGGAAACTGGATTT T GGAGAGACTTT GGAT T T GGGAT GACGT GT CAGTATC GGT CAGAC 
TTCAT CAATAT AGGT GGGTTT GAT CT GGACATCAAAGGCT GGGGC GGAGAGGAT GT GCAC CTT TAT CGCAAGTAT CT C 
CAC AG CAAC CT CATAGT GGTACGGAC GC C T GT GC GAGGACT CT T C CAC CT CT GGCATGAGAAG C GCTGCAT GGAC GAG 
CT GAC CCCC GAGC AGTACAAGAT GT GCAT GCAGT CCAAGGC CAT GAAC GAGGCAT CCC ACGGC CAGCT GGGCAT GCTG 
GT GT T CAG GCAC GAGATAGAGGCT CACC T T C GCAAACAGAAAC AGAAGACAAGTAGCAAAAAAACATGAACT CCCAGA 
GAAGGATT GT G GGAGACACT TTTT CTTT C CTTT T GCAATTACT GAAAGT GG CT GCAACAGAGAAAAGAC T T CCATAAA 
G GACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGC C T C CGATTTCT CT CT GTT GGGCT TTT TACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATT AT AAG CCTAAT GGT GT GGAGGTT TT GAT GGT GT TTAC AATACACT GAGACCTGTT GTTTT GTGT GCT CATT GA 
AAT AT T CAT GATTTAAGAGC AGT TTT GTAAAAAAT T CATTAGCAT GAAAG GCAAGCATATT TCT CCTCAT AT GAAT GA 
GCCTAT CAGCAGGGCT CTAGTTT CTAGGAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATT ATGAT ACT 
AGT GAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATAT CGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT TT TT CC CT T GT GAGT T AT AGT CT GCT TAT TTAATTAC CACT TT GCAAGC CTTACAAGAGAGCACAAGTT GGC CT AC 
ATTTTTATAT TTTT TAAGAAGATACT TTGAGAT GCATT AT GAGAACTTT CAGTT C AAAGCATCAAATT GAT GCCAT AT 
CCAAG GACAT GCCAAATGCT GATTCT GT CAGGCACT GAAT GTCAGGCAT T GAGACATAGGGAAG GAAT GGT TT GTACT 
AATAC AGAC GTACAGATACTT T CT CT GAAGAGT AT TTT CGAAGAGGAGCAACT GAACACTGGAGGAAAAGAAAAT GAC 
ACT T T CT GCT TTACAGAAAAGGAAACTCAT T CAGACT GGT GATAT CGT GAT GTAC CTAAAAGTCAGAAACCACAT TTT 
CTC CT CAGAAGTAGGGACCGCTTT CT T AC CT GTT T AAATAAAC CAAAGTATAC C GT GT GAACCAAACAAT CTCT T TTC 
AAAACAGGGT GCT CCT CCT GGCTT CT GGCT TC CATAAGAAGAAAT GGAGAAAAAT ATAT AT ATATATAT AT ATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAAT TATT T TT TAAAT TAAGCAGT TCTACT CAAT CACCAAGAT GCTT CTGAAAAT T GCATT TTAT T AC CAT TT 
C AAACTATT TT TT AAAAAT AAAT ACAGTTAACAT AGAGT GGTTT C TTCATT CAT GT GAAAATTATT AGC CAGCACCAG 
AT GCAT GAGCTAAT TAT CT C TTT GAGTC CTT G C TT CTGTT T GCT CACAGT AAACT CATT GT TTAAAAGCTT CAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGTGGTAT T GCACAGC TAATAAAATAT GATTT GT GGATATGAA 



FIGURE 72 

MMMVRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANS 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP YT AS D F I E G I YRTERDKG T L YE LT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSPCAANFRNF 
TFI QLNGE FSRGKGLDVGARFWKGSNVLL FFCDVDI YFT SE FLNTCRLNTQPGKKVFYPVL FSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVTOTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTS SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites . 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T T GTCCTGGGGAT CCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGAT AAAT ACACT CACGCC AGG AGC TCGCTCGCTCTCTCTCTCTCTCTCT C AC T CCT C 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGC CCAG T CGCCCATCGAT AT T CAGACAGACAGT G T GACAT T T GACCCT GAT T T GC C 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGC T CCACCTGCAC T GGGGTCAGAAAGGATCCC CAGGGGGGTCAGAACACCAGAT CAACAG 
T GAAGCCACAT T T GCAGAGCT CCACATT GT ACAT TAT GACT CT GAT T CCT ATGACAGC T T GAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGC TTATGAACACATTC TGAG T CACT T GCAT GAAGTCAGGCAT AAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACT G TAGGAGTAG TAAGCAGAT GT CC TCCT T CCCC TGGACAT C T C T TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCCT T AGGATAAAGAGT T GC T GT T GAAG T T GTATAT T T T T GAT CAATAT AT T T GGAAATT AAAG 
TTTCTGACTTT 
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FIGURE 7d 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 



Important features of" the protein: 
Signal peptide: 

amino acids 1-15 



, : Transmembrane domain: 

f- amino acids 291-310 



0 N-glycosylation site. 

V? amino acids 213-216 



Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7fi 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 

TCTGGTGGTTTGCC TAAACC T GCAAACAT CACC T T C T TATCC ATCAACAT GAAGAATGT C CTAC A 

AT GGACT CCACCAGAGGGTC TT CAAGGAG TTAAAGT T AC T TACACT GT GCAGTATT TCAT CACAA 

AT T GGCCCACCAGAGGTGGCACT GACTAC AGATGAGAAG T CC AT TTCTGTTGTCCT GACAGC TCC 

AGAGAAGT GGAAGAGAAATCCAGAAGACC TT CC T GT T T CCAT GCAACAAAT ATAC TCCAAT C TGA 

AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 

ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

T C T G T GAT GGGC TAT T C CATCTACCGATATATCCACGTT GGCAAAGAGAAACACCCAGCAAAT T T 

GATT TT GATT TATGGAAATGAATT TGACAAAAGAT TC TT TGTGCCTGCTGAAAAAATCGTGAT TA 

AC T T TAT CACCC TCAATAT C TC GGAT GAT T CT AAAAT T TC T CAT CAGGAT AT GAGT T T AC TGGGA 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 

7\AAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 

ACAGTCAT TGAATATGAATATGATGT CAGAACCACTGACATTT GTGCGGGGCCTGAAGAGCAGGA 

GCT CAGT T T GC AGGAGGAGGTG TCCACAC AAGGAAC AT TAT T GGAGTC GCAGGCAGCG T T GGCAG 

TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 

TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 

TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 

CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 

GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



FIGURE 76 

MSYNGLHQRVFKELKLLTLCS ISSQIGPPEVALTTDEKS I SWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I T VFLFSVMGYS I YRY I HVGKEKHPANL I L I YGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 



W Important features: 

2 Signal peptide: 

g B | amino acids 1-28 

Sj 

pj Transmembrane domain: 

+ n amino acids 140-163 



m 



N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC T GAAGGACCACAAC GC CACCAGCATCC T GC AGCAGC TGCCGC T GC T CAGT GC CATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
H TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
^ CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
m TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
||| CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
\| CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
Wj TCCTGCCAT CAAGGGTGACACCAT TCAGCT CTACCTGGGGGCCAAGTTGT TGGACTCACAGGGAA 

J* AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
f»* TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
U| ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
Cl ■ ■ T GAT CAATGAAAAGGCTGCAGAT AAGCTGGGAT C TACCCAGAT CGTGAAGATCC TAAC TCAGGAC 

|y " ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 

TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 



ru 

T TTACACCAAAGGT GACCAACT TAT AC T CAAC T T GAAT AACATCAGC T CT GATCGGAT CCAGC T G 
ATGAAC T C T GGGAT TGGC TGGT TCCAACCT GAT GT T C T GAAAAACAT CATCAC TGAGATCAT CCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAG TGAA GACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACAC.TTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKH 1 1 WLKVI TANI LQLQVKP SANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQAT IRMDT SASGP TRLVLSDCAT SHGS LRI QLL YKLS FLVNALAKQVMNLLVPS LPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVP ISLSI DRLE FDLL YPAIKGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DRI QIjMNS G I GW FQPDVLKN I ITEIIHSI LLPNQNGKLRS GVPVSLVKAL G 
FEAAE S S L TKDALVLT PAS LWKPS S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminogly can attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAA.TCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GT CAGAAGG TGC T GC T GAGGAT AGAC T GAC T T T GGC CAT T GGAT T GAGCAAAGGCAGAAAT GGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTGCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATT TGTGCATGAAC TGAAATAAAACCATCCTACGGTATGCAGGGAACAGAAAGCAGGATG 
CAGGAT GGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAAT A 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
IYSTLLGLPADIQAAQAMMVTSSAI SSIACI ISWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFI PVAWNLHGI LRDFYS PLVPDSMKFE I GE AL YLG 1 1 S S LFS L I AG 1 1 LCFS CS S QRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATG GTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGGAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGAC T T GGTGGAGATC TACC TC T GGAAGC TGGTAAAAGATGAGGAAACTGAGGC T CAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGT C T TCC TAGCCCAGACC CT GGAGC TGAAGGT CACGGCCAGT C CAGACAAAGT GAC CAAG 
AC AT AAC AAAGAC C TAAC AG T T G C AGAT AT G AGC T G T AT AAT T G T T G T TAT TAT AT AT T AAT AAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLSPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKA.CVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 

GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GAGGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTT CT CT AGCGTGGT CC AC CT CT AC AG GGACGGGAAGGACC AGCCAT TT AT GCAG AT GCCAC AGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

u 

: KS , AC AG GT GTCAGC ACT GGGCT C AGT T C CT CT C ATT T C CAT C ACGGGATATGTT GATAGAGACAT CCAGC T AC 

5=5 TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ff= AC AG AC TCCAGGAC AAAC AG AGAC AT GCAT GGCC T GTT T G AT GT GG AG AT CT CTCT GACC GT CC AAGAGAA 

UJ CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
\| GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
||| TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACC CGAAGCTCTGCG T TTCTG AT CT G AAAAC TGTAAC CC AT AG AAAAGC TC CC C AGGAGGTG 
CCT C ACTCTGAG AAG AGAT T T ACAAGGAAG AGTGT GGT GGCT TC TC AGAGTT T CC AAGCAGGGAAACATT A 

m 

CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
^ . AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
jp% ' ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTC ACAGGCAACCACGCCCTT CCTCCCCAGGGGTGAAAT GTAGGATGAATC ACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 



APP ID=1 0063734 



Page 237 of 322 



FIGURE 84 

MALMLS LVLS LLKLGS GQWQVFGPDKPVQALVGEDAAFSC FLS PKTNAEAME VRFFRGQFS SWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI S SQS YYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I V I C PVT QE S E KE AS WQRAS AI PET SNS E S S S QAT T P FL PRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCAC CCT GAGCATCAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCGATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATAGGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TCT TGTAGAAT TAACAGC CC T CAACGT GATGAGCTATGATAACAC TATGAAT TAT G 
TGCAGAGT GAAAAGCACACAGGC TT TAGAG T CAAAGTATCTCAAACCT GAAT CCACAC T GT GCCC 
TCCC T T T TAT T T T TT TAAC TAAAAGACAGACAAAT T CCT A 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
S VLT L I PQPQDHGT S LTCQVT FPGAS VT TNKT VHLNVS YP PQNLTMT VFQGDGT VS T VLGNGS S L 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGS QQVY LNVS LQS KAT S GVT QGWGGAGAT ALVFL S FCVT FVWRS CRKKS ARPAAGVGDTGI E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
!? YSEIKIHR 

fp Signal peptide: 

fj.f amino acids 1-15 

% JJ Transmembrane domain: 

xkjs 

amino acids 351-370 

tJ 
■i m 

Ul - 

o 

CO 
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FIGURE 87 



3 : 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
N ? CAAC TACAACACC T T TGGATC T GCAGAGGCGGCCACGAGC GATGAC TACAAGAACCCT GGCTAC T 

Jjf ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
S AGAAACAGCTCCC TGCT GAGGTACCGCACGGACAC T GGC T TCC T CCAGACACTGGGACATAATC T 

yj GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
\J CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
lM GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
f*% GAT ACT TT CCAGAGGCCAGTCCC CAGCAG T GT GGAGAT TTTTCTGGTTTT GAT T GGAGTGGATAT 

If! GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
Q TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
W CAAC TT ACCCAGTAGC TAGAATG TT AAT GGCAGAAGAGAAAACAATAAATCATAT TGAC TCAAGA 

2T AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKSIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
S DDYKNPGYYD I QAKDLGI WHVPNKS PMQHWRNS S LLRYRTDT G FLQTLGHNLFG I YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

GlycosaminoglYcan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites . 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCGCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
H TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
5f TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
m GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
hj T C T T GAGT CTGGGAT TATCCGCAT T GTAT T TAG T GC T T TG TAATAAAATAT GTT T TGTAGTAACA 

% T TAAGACT TATAT ACAGT T T TAGGGGACAATTAAAAAAAAAAAA 

ill 



m 
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FIGURE qo 
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MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGMCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 



P 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
T CT AC AT CGCC T TC TACAGCACGGAC TAT C GGCTGG TCCAGAAGGT GT GC CC AGAT TACAAC TAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCAGAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCGAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVE FHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites ♦ 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATG T TCCGAT T TGCATAT TATAAAC TCT T AAAAAAAGCCAGT GAAGGT TT GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
jU GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
O ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
;!!f TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
Z's TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
SJ AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
W AACC T CAGGGAACCAGCAC T T CC CAAACCGCAGAC TACAT C TT TAGAGGAAGCACAAC TGTGCCT 

# 5 TT TT CTGAAAAT CCC T T T T TCT GGT GGAAT TGAGAAAGAAATAAAAC TAT GCAGATA 



£3 

M 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRI I FLIAGAFFWLVSLLI SSLVWFMARVI IDNKDG 
P TQKYLL I FGAFVSVYI QEMFRFAYYKLLKKASEGLKS I NPGETAPSMRLLAYVS GLG FGIMS GV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT FI SS YYGINLASAFI ILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site, 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 



III 



AAT T T T T CACCAGAGT AAACT T GAGAAAC CAAC TGGACC T T GAG TAT T GTACAT TTTGCCTCGTG 
GACCCAAAGGTAGCAATC T GAAACATGAGGAGTACGAT TCTAC T GTT TT GT CT TC TAGGAT CAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TACCAAACCAACAGCAGTCAAAT CAGG TC T TTCCTTCTT TAAGT CT GATAC CAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAAT CCAGAT G TCCAGGAT GGAAGCCT TC CAGCAGGAGGAGCAGGT GT AAAT CC T GCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGATTATATTATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAATTAATTCTTAATT 
T AC C T GAAAAT AT T C T T GAAAT T T C AGAAAAT AT G T T C TAT G TAG AG AAT C C CAAC T T T T AAAAA 
C AAT AAT T C AAT GGATAAAT CTGTCTTT GAAAT AT AACAT TAT GCTGCCTG GAT G ATAT GC AT AT 
TAAAACATATT TGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRS T I LLFCLLGS TRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPS LSL I PLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATGCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
G GAT GAG C C AG G C AC AG C C AAC TCGGATCTTC GAT T C C AC AT C C T GAG C C AG G C T C C AG C C C AG C 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
y CCAGGCC T CAGGCCACCAGGCCACTGCC ACCGT GGAAGT C TCCAT CATAGAGAGCACC T GGGTGT 

CI CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
|!l CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
Ul TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
J't AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
y. ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
W CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
JH ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
?!°f TCGAAGT CGCAG TCACAGATAT CAAT GAT CAC GCCCC TGAG TT CATCACTT CCCAGAT TGGGCCT 

!?! ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
P CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
fn GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
35 GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
| s { AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
III ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA . 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPASLSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSG,GDVHYHLE SHPPGP FEVNAEGNLYVTRELDREAQAE YLLQVRAQNSHGE DYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
P I SLPEDVE PGTLVAML TAI DADLE PAFRLMD FAI ERGDTEGT FGLDWE PDS GHVRLRLCKNL S Y 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPWVSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 
amino acids 1-18 

Transmembrane domain: 

amino acids 7 62-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T CTT CCCAGGAACACAAACGTAGGAGACCCACGCTCC T GGAAGCACCAGCC T T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATC TAGGACCCAGGCATCT TGCT TT CCAGCCACAAAGAGACAG ATGA AGAT GCAGAAAGGAAAT G 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
T CT GC CAACAC TGGATCCAGT GT GAT CTCCAGT GGAGCCAGCACAGC CACCAAC T CT GGGT C CAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCAC CAAC TCT GAG T CCAGCACACCC TCCAGT GGGGCCAGCACAGT CACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAG.CACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAAC T CT GACT CCAGCACAACC TCCAG T GGGGC TAGCACAGCCACCAACT C T GAGTCCAGCACAA 
CCT CCAG T GGGGCCAGCACAGC CACCAAC TCT GAGTCCAGCACAGT GT CCAGTAGGGCCAGCAC T 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
^ AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
T CCAGCACGACC TC CAGTGGGGC CAGCACAGCCACCAAC TC TGAGT C CAGCACGACC TCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTC CAGCACAGT GT CCAG T GGGATCAGCACAGTCACCAAT T CT GAG T CCAGCACACC CTCC 
AGT GGGGCCAACACAGCCACCAACT C T GAG T CCAGT ACGACCT CCAGT GG GGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTC T GAC TCCAGCACAACCTCCAGT GAGGCCAGCACAGCCACCAAC TC TGAGT C TAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G T GGAGT CC TAAC TGG T TC TGGAGGAGACCAG TATCATCGATAGCCAT GGAGAT GAGC GGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT CT TGAA.GAAGGTATTCCTCACCT TT CT TGCCT T TACCAGACACTGGAAAGAGAATACTATAT 
T GCTCAT T T AGC TAAGAAAT AAATACAT C T CAT C T AACACACACGACAAAGAGAAGC TG TGC T T G 
CCCCGGGGT GGG TAT C TAGC TCT GAGAT GAAC T CAGT TATAGGAGAAAACC TCCATGC T GGAC T C 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCA7VAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNE TS T S ANTGS S VI S S GAS T ATNSGS S VT S SGVS TAT I S GS 
SVTSNGVSIVTNSEFHTT.SSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGT ATMS ESS TVS S GI S T VTNSE S S T PS S GANTATNS E S S T T S S GANTATNS ESS TVS S GAS TA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

S i gna 1 pep ti de : 



%U amino acids 1-20 



Transmembrane domain: 

III amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAG.TGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TCTAGC TACT GCACGT AC TT CATAAAGCAGGAC T C T AAAAGCT TTGGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT T AT AAC T G GGGAT G CAT TTGTTCCTG G AG AAAGAAG T G T CAT TAT CAT G AAC C AT C GGAC AA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 



5'""s 

m 



%! C T TCAT T CAT AGGAAATGGAAGGAT GACAAGAGCCAT TT CGAAGACATGAT T GAT TACT TT T GT G 

III AT AT T CACGAACCAC T TCAAC T CCT CATAT T CCCAGAAGGGAC T GAT C T CAC AGAAAACAGCAAG 



4- 



ru 



TCTCGAAGTAATGCAT TTGCTGAAAAAAATGGACTTCAGAAATAT GAATAT GTT T TACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
C C C AGGG AAAT CCAC T T T C ACGT CCAC CGG TAT CCAATAGACACCC T C C CCACAT C C AAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGG AGAAGAAT T T T TAT T T TACCGGAC AGAG TG TCAT TCC ACC T T GC AAG T C T GAAC TCAGGG T C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATT TGGTGGAC TGGAGAT CATAGAACTTGCATGTTACCGACTT TTACACAAACAGCCACAT 
T T AAAT T C AAAG AAAAAT GAG TAAG AT T AT AAGG T T T GC CAT G T G AAAAC C TAG AG CATAT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CATTAT TTGT TAAAGATATT T TGCACTTAATT TTGTGGGAAAAATAT TGCTACAAT TT TT 
T T T AAT C T C T GAAT G T AAT T T C GAT AC T GT GTACATAGCAGGGAG TGAT CGG GGT GAAA.TAAC T T 
GGGCCAGAATAT TAT T AAACAAT CAT CAGGC T T T TAAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVS S YCTYFI KQDSKS FGIMVSWKGI YFI LTL FWGS FFGS I FMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS ILYWTLFSPAMCLLI YLYSLVKWYFI ITIVI FVLQERI FGG 
LE I I ELACYRLLHKQPHLNSKKNE 

■gf Important features of the protein: 

ffl Signal peptide: 

hi amino acids 1-22 

k. I 

'"si 

Transmembrane domains: 

'' Sp? amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGG'CTCGAGTGAAGAGCCTCTCCACGGCTCGTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CATC CATCCACCCC TGC T G TGAT CT GT T T TCATAGTGTGAGAT CAACC CACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAATC CAGGCGGAACT GGAC TGGAGAAGAAAGCAC GGACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T TCT GAT C T GAAAAC TGTAAC CCATAGAAAAGCTCC CCAGGAGGTGCCT CACT C T GAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
C G T G G G ACAAAATG TAGGG TG G TAT G T GG GAG T G T G T C G GGAT GACG T AGAC AGGG G GAAGAAC A 
AT GT GAC T T T G TC T CCCAACAAT GGG TAT T GGGT CC T CAGACT GACAACAGAACAT T TGTAT TT C 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACATGTCAGT T TGAAGGCT T G TT GAGACCC TAT AT CCAGCAT GCGATG TATGACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
T CAGAAGCCATGGCT GCCC TGAAGT GGGGACGGAATAGAC TCACATTAGGT T TAGTT TG TGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGT GAT T CC TGCC TCACAGGT GAAGAT T AAAGAGACAACGAATGTGAAT CATGC T TGCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
T GCTT TGTGAGGC TCAAAGAATAAAGAGGAGGTAGGAT TT TTCACTGATT CTATAAGCCCAGCAT 
T AC C T G AT ACCAAAACC AGGC AAAGAAAAC AGAAGAAGAGGAAGGAAAAC T AC AGGT CC AT AT CC 
C T CAT TAAC AC AGAC AC AAAAAT T C T AAAT AAAAT T T TAACAAAT T AAACT AAAC AATATAT T T A 
AAGAT GATATATAAC TAC TCAG T GT GGT T T GT CC CACAAATGCAGAGT TGG T T TAATAT T T AAAT 
ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWES KQMP Q YRGRTE FVKDS I AGGRVS LRLKNI T P S D I GLYGCW FS S Q I YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKYLIGETFFQPSPWRIASILLGLLCGALCGVWGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQOTGWWGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE log 



CC TT C ACAGGAC TCT T CAT T GCTGGT T G GC AATGAT GT ATCGGCCAGATGTGGT GAGGGC T AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TG AC AAAC T ATATGCT GAGT TT GGCAGAGAGGCTTC TAACAATT T T AC AG AAAT GAGCC AGAGACT T GAAT 
CAATGGTG AAAAAT GC AT T T TATAAATCT C CATT AAGG GAAGAAT TTGTC AAGT CT CAGGTT AT CAAGTT C 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATCCTC ACT C AG TT AAAATT AAAAAAAT C AAC AAGAC AGAAAC AGAC AGC TAT C T AAACC AT T G CT GC GG A 
AC ACGAAG AAGT AAAACTCT AGGT CAGAGT CT C AGGAT CGTT GGTGGG AC AGAAGT AG AAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
||| AT AAAACCT TCG AAAATGAAACGG GGTCTC CGGAGAAT AAT T GT CCAT GAAAAAT ACAAACACCCATC AC A 

\j TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
k| TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
4« GATGGT TACAGT C AAAAT CATCTT C G ACAAGC AC AGGT GACT CT CAT AGACGCT ACAACT TG CAAT GAACC 

s T C AAGC TT AC AAT G ACGCCAT AAC T CCT AGAATGT T AT GTGCTG GCTCCT T AGAAGGAAAAACAGATGC AT 

^ GCC AGGGT GACT CT GG AG GACC AC T G GT T AGT T C AG AT GC T AGAGAT ATC TGGT ACCTT GCT GGAATAGTG 

If! 

AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
J? GATTACTTCAAAAACTGGTAT.CTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
14 GGTGTGGAGGCC ATT T TT AG AG AT AC AG AATT GG AG AAGACT T GC AAAAC AGCT AG ATT T GAC T GATC T C A . 

jf|| ' ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACTCT GT CAT C TGTG AGCAAT AG T T GAAACT TT AT GT AC AT AGAGAAAT AG ATAAT AC AAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
T C AAGGT GCAGAACAAGG AGTGAAAG AAAAT AT AAG AAGAAAAAAATCCC CT AC AT T TT ATT GGC ACAGAA 
AAGT AT T AGGTGTTT T TC TT AGTGGAAT AT T AGAAATGAT CAT AT T CATT ATGAAAGGT C AAGC AAAGAC A 
GC AGAATAC C AATC AC TT CATC AT T T AG G AAGT AT GGGAACT AAGT T AAGGAAGTCCAG AAAGAAGCC AAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AATT AT ACAAACT TC AT GC AAT GT ACT T GT T CT AAGC AAATT AAAGC AAAT AT TT AT T T AACATTG 
TT AC TG AGGATGTC AAC AT AT AAC AAT AAAAT AT AAAT CACCC A 
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FIGURE 106 

MMYRPDVVRARKRVCWE PWVT GLVI FI SL I VLAVC I GLTVHYVRYNQKKT YNY YS TLS FT TDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYE FQPGDVMFVTGFGALKNDGYS 
QNHLRQAQYT L I DAT T CNE P QAYNDAI T PRML CAG S L E GK T DAC Q GD S GGP L VS S DARD I W Y L AG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTT CTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCAT TT TGAC TTAC TTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
G T GT AGAAC TGGCT T GTGGC GT T T GGACATAT GAACAGGAACT TATGG T TCCAGTACAATGGTCA 
GATAT GGT CAC T TTGAAAGC CAGGAT GACAAAT TATGGAT TACCTAGAT AT C GGT GGC T TACTCA 
T GCT T GGAAT T T TT T T CAGAGAGAG T T TAAGTGC TG T GGAGTAGTAT AT T T CAC T GAC TGGT TGG 
U\ AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 

CAGGCCCACCAGGAAGATC T CAGTGACCT T TAT CAAGAGGGTT G T GGGAAGAAAATGTATT CC T T 
y T TTGAGAGGAACCAAACAACTGCAGGT GCTGAGGTTTCTGGGAAT CTCCATT GGGGTGACACAAA 

111 TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
||| ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 

G T T G AAAC C AAG C C T G T C AAG AAT C T T T G AAC AC AC AT C CAT G G C AAAC AGC T T T AAT AC AC AC T 
/I T TGAGATGGAGGAGT TATAAAAAGAAATGT CACAGAAGAAAACCACAAACTTGT T TTATTGGACT 

m T GTGAAT T T TT GAGTACATACTATGT GT T T CAGAAATATG TAGAAATAAAAAT G T T GCCATAAAA 

4* TAACACCTAAGCATATACTATTCTAT GCTT TAAAATGAGGATGGAAAAGTT TCATGTCATAAGTC 

s ' ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
f*| T GT G TAT GAC T T T TAC T GAACACAG T TAT G T T T T GAGGC AGC AT GGT T T GAT T AGC AT T T C C G C A 

?p T CCAT GCAAACGAGT CACATAT GGTGGGACTGGAGCCATAGT AAAGG T T GAT T TAC T TC TACCAA 

Wj C T AGTATAT AAAGTAC TAAT TAAAT GC T AAC AT AGGAAGT TAGAAAATACT AATAAC T T T TAT T A 

CI CT CAGCGAT CT AT TC T T C T GAT GC TAAAT AAAT TAT ATAT CAGAAAACT TT CAATAT TGGT GAC T 

|I| AC C TAAAT G T GAT TTTTGCTGGT T AC T AAAAT AT T C T TAC CAC T T AAAAG AG C AAGC T AAC AC AT 

f*} TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
Jj{ T T CAG T T C T GAT AAT G T T AAG AAT AAC CAT TAT GAAAAGG AAAAT TTGTCCTG TAT AGC AT CAT T 

^ ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

TGTTATTTAAATACTTAACCACTAATTTTGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
. AGAATGTAGTCTGGTCTT TAGGAAGTATTAATAAGAAAAT T TGCACATAACT TAGT TGATTCAGA 
AAGGACT T G T AT GC T GT T T T TCT CCCAAAT GAAGACT C T T T TT GACACTAAACAC T TT T TAAAAA 
GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTTTGTTT TAT T T CAC TGAT TAAT AT AC T G T GGCAAAT T AC AC AGAT TAT TAAAT T T T T T T AC AA 
GAG TAT AGTATAT T TATT TGAAATGGGAAAAGT GCAT T T TACT GT AT T TT GT GT AT T T TG T T TAT 
T TCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATAT T TTCTAGAGAGTAA ' 
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FIGURE 108 

MAREDSVKCLRCLLYALNLLFW 

VMIAVCCFLI I VGMLGYCGT VKRNLLLLAW Y FGS LLVI FCVELACGWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLG I S IGVTQI LAMILT I TLLWAL YYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 



O 

m 



fill 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCT CTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCT AGATCCTT GTTCT GCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACTGACCACCAGTT GGAT GAGTCTCAAGGTCC TCCT CTAT GTGACAACC AT GTGAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
r;- CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
y CAGCTGCTCAGATACCAGCGAGTGCACATGCGCTCCAGGAACTGTGCTAGGCCCTGACAGGCAGACATGCTTTG 
^!!;! ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
f I GAGT GT GGGGT T GGCCGTGT GCT AAGAAGT GAT GGC AAGACTT GTGAAGAC GTT G AAGGAT GC C AC AAT AAC AA 

UJ 

TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 

,- '1 CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 

Mr 

W CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
| fl ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
r| CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
[g GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
p TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
|I| CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGCTGGGT AT AAT ATTTCAAGTT ACAAACCC TAGAAAAAT TAAACAGT TACT GAAAT T AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T T AT AGG GAAT T T G GAAGT GT AT C AAT AAAAC AGT AT AT AAT T T T 



APP ID-1 0063734 



Page 263 of 322 



FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVTSILKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAI EVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPVVHV 
SGLESLVESC FAT PT S KI D E VL KY Y L I RDGC VS DD S VKQ Y T S RDHL AKH FQ VP V FK FVGKDH KE V FLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids .411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 ' 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AG AC C T TC AG AAAGGT GGGGAT CC C CAT C AT C AT AGC ACT ACT G AG CCTGGC GAGT AT CATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCT GT GT GACGGAGAGCT GGACT GT CCCT TG GG GGAGGACG AGGAGCACT GT GTC AAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
^ AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
!f GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 

*5s GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 

y s 

jT 3 | TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
V| TC AGC ATCCAGT ACGAC AAACAGC ACGT CT GT GG AG GGAGC AT C C TGG AC CCCC AC T GGGTCCTC ACGGC A 

|y GCCC AC TGCT TC AGGAAAC ATACCGATG TGTT C AAC TGGAAGGT GCGG GC AGGC TC AGAC AAACT G GGC AG 

*ps CTTCCC ATCC CT GGC.T GT GGCCAAGAT C ATC ATC AT TG AATT C AACCC C ATGT ACC CCAAAG AC AATG AC A 

5. TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
^ GATG AGGAGC TC ACTC C AGCC ACCCC ACT C TGGAT C AT TGGAT GGGGCT T T ACGAAGC AGAAT GGAGGGAA 

G ATGTCTG AC AT ACT GCT GC AGGCGTCAGT CCAGGT CAT T GACAGC AC AC GG TGCAAT GCAGAC GATGCGT 
7? ACCAGGGGGAAGTCAC CG AGAAGAT GAT GT GT GC AGGC AT CCCGGAAGGGGGT G TGGAC ACCT GCC AGGGT 

Js| GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
ifsy CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 

GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTAT T GCT AAGCC AAGAAGGAACTT TCCC AC ACT ACT G AAT GGAAGC AGGC TGT CTT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AG AGC AAGAAAC C AGTT GT AAT AT AAAAT GCACTGCCCT AC TGTT GGT AT GAC TACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGI PI I IALLSLAS 1 1 1 WVLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVE IGPDQDLDWE I TENSQELRMRNS SGPCLSGSLVS LHCLACGKS L 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKI I I IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWI IGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWG I VS WGYGCGGPS T PGVYTKVSAYLNWI YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 



hi 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGT GGCAGGGAAGGAACT TGTGCCAAAT TATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGC TCTCAT CAGCCAGGCAGCAGCCAAAA 
fl TCTGGGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
|:|| CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
O GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

ill 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 
T GGT T ATT TGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCG-FQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACC ATG GCAA 
AGAAT CCTCCAGAGAAT T G TGAAGAC T GT CACAT TC TAAAT GCAGAAGC T T T TAAATC CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAATTGAT CCTGTGACCAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACAT TGGAAGTGCACGACTT TAAAAACGGATACACTGG 
CATC TACT TCGTGGGTC T T CAAAAAT G TT T TAT CAAAAC T CAGAT T AAAGT GAT T CC TGAAT T TT 
U CT GAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAAC T T TC T T TGAACAG TCAGTG 

Q AT T T GGGT CCCAGCAGAAAAGCC T AT T GAAAACCGAGAT T T TCT TAAAAAT T CCAAAAT TC T GGA 

P GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
f I ACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
C| GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
pj AGAACT T C CAATAAAT GAC TAT AC TGAAAATGGAAT AGAAT TT GAT CC CAT GCT GGATGAGAGAG 

«{!.* GTTATTGTTGTATTTACT.GCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
? GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
hJ TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 

y i 

TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 

T?s:.? 

jg GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
13 TTATCACCAAAAAAAAAAAAAAAAAAA 

5 If 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HT FYSNGEKKKI YME I DPVTRTE I FRS GNGTDE TLE VHDFKNGYTG I Y FVGLQKC FIKTQ IKVI P 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGT GT TCCAGT AC GAAGGGCT CT GG AGGAGCT GCGTG AGGCAGAGT T C AGGCT TC ACCGAAT GCAGGCCC 
TATT TC ACCATCCTGG GACT TCCAGC CAT G CT GCAGGC AGTGCGAGCC CT GATG AT C GT AGGCATCGT CCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAG CCAACAT G AC ACT G ACCT CC GGGATCATGT TC ATT G TCTCAGGT CTTT GT GC AATT GCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
T C AC ACTAAT TGGGGGTGT G AT GATGTGC ATC G CCT GC CGGGGC CTGGCACCAGAAGAAACC AACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
T T CT AC TCTGAT GAGAGAAT GT GGTT TT AATC TCTC TCTCAC AT TT T GAT GATT TAGACAGACTCCCCCTC 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
AC AC TGTGAT CT T AAAAGT T AC CAAACC AAAG TC AT TT T C AG TT TGAGGC AACC AAACC T TT CT ACTG CT G 
T TGAC ATCT T CT TAT T ACAGCAACAC CAT T CTAG GAGT TT CC TG AGCT CT CCACTGG AGT CC TCTT TCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AATAAATAAT GTTT T AGT AAAATGAT AC ACTATCTCTGTG AAAT AGCCT C AC CCCT AC AT GT GGAT AGAAG 
GAAATGAAAAAAT AAT TGCT T T GACATT GT CT AT AT GGT AC T TT GT AAAGTCAT GCT T AAGT AC AAAT TCC 
ATGAAAAGCT CAC AC C TGT AAT CCT AGC AC TT T GGG AGGCTG AGGAGGAAGGAT CACTT G AGCCCAGAAGT 
T CGAGACT AGCCTGGG CAAC AT GGAG AAGC CCT GT C TCT AC AAAAT AC AGAGAGAAAAAATC AGCC AGTC A 
T GGT GG CAT AC ACCT G T AGT CC CAGC ATT C CGGGAGGC T G AG GT GGGAGG AT CACT T GAGCCC AGG G AGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
Y FT I LGLPAMLQAVRALM I VG I VLGAI GLLVS I FALKC I RI GSME DS AKANMTLTS G IMFIVS GL 
CAIAGVSVFANMLVTNFWMS TANMYTGMGGMVQT VQTRYT FGAALFVGWVAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains : 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
Ui CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

5 n S 

TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
B ^ ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
% AATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
9 TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
Q CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

ACGACATAGCATTATGTACATAGATGAGT GT AACAT T TATATCTCACATAGAGACAT GCT TAT ATGGT 
T TTAT TTAAAATGAAAT GCCAGTCCAT TACACTGAATAAAT AGAACTCAACTATTGCTTT TCAGGGAA 
ATCAT GGATAGGGTT GAAGAAGGT TACTAT TAATTGT TTAAAAACAGCTTAGGGATT AATGTCCTCCA 
TTTATAAT GAAGATTAAAA.T GAAGGCT TT AATCAGCATTGTAAAGGAAATT GAATGGCTT TCT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAAT ATT TTT GT TT TT GTATTTGAAGAAGAAT GAT GCAT TTTGACAAGAAAT CATATAT GTATGGAT 
ATAT T T T AAT AAGTAT T T GAG TAC AGACT T T GAGGT T TCAT CAAT ATAAAT AAAAGAGC AGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT TTAT ATTAATAAAT TGTACATT TT T CTAAT T 



U 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCTOQANIRM 
IYDSLLALSPDLQAARGLMCA^SVMSFIAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWLIPVSWVANAIIRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 



Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 



_ HI 
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FIGURE 121 

GGAGAGAGGCGCGCGGG.TGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCGGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGGTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAATGTC TGAGGGAAAGC T T T GAGGAGT CC T GGACACCCAAC T AC 
AAGCAGT GTTCATGGAGT TCATTGAAT TATGGCATAGATCTTGGGAAAAT TGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT T GAAGC TAT AAT T TAT T TGGACC AAGGAAGCCC TGAAAT GAAT T CAACAAT TAAT AT TC AT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T T GGCAC T T GT T CAGAT TACCCAAAAGGAGATGC T TC TAC TGGAT GGAAT T CAGT TT C TCGCATC 
ATTATTGAAGAACTACCAAA ATAAA TGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
TTGGAAT GGT T CAC T TAAATGACAT T T TAAATAAGT T TATGTATACATC T GAATGAAAAGCAAAG 
CT AAATAT GT T TACAGACCAAAG T G T GAT T TCACAC T GT T T TT AAAT C.T AGCAT TAT TCAT T T TG 
C T T CAAT CAAAAGT GG T T TCAATAT T T T T T T TAG T TGGT TAGAATAC T T TC TT CATAG T CACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT AT AAAAGC TAC CAAT CT T T G TACAAT T TG TAAAT GT TAAGAATT T T T T TTATAT C TGT 
TAAAT AAAAAT TAT T TCCAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 



Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GC CCC TAAGCCAAAGCAAAAGAC C T AAG GACGACCT T TGAACAATACAAAGGATGGGT TT CAAT G 
TAAT TAGGCTAC TGAGCGGATCAGCT GT AGCACTGGT T ATAGC CCCCACT GT C T TAC TGACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
l± ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
^ AT AACAGCC T T CAAAAAC T TAAGTAT AATCAAT T TAAAGGGCT CAAC CAGCTCACC T GGC TATAC 

O CTTGACCATAACCATATCAGCAATAT TGACGAAAATGCT TT TAATGGAATACGCAGACTCAAAGA 

CI GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
p| TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
||| CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
H AGACT GCC GCAACC T GGAAC TT TT GGACC T GGGATATAACCGGATCCGAAGT T TAGCCAGGAAT G 

y\ TCT TTGC TGGCATGAT CAGACT CAAAGAAC T T CACCT GGAGCACAAT CAAT T T T CCAAGCTCAAC 

W CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
■It CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
p* T CCAACAAGC T CACAT T TAT TGGT CAAGAGAT T TTGGAT TC TT GGATAT CCCT CAATGACATCAG 

TCTTGCTGGGAATATATGGGAATGCAGGAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
Ul TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
■P ATCGATGCAGT GAAGAACTACAGCAT CTGT GGCAAAAG TAC TACAGAGAGGT T TGAT CTGGC CAG 

m GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
S TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 

TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
hi CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCT7\AAGCAAATGACTCCCAGCACCCAGGAATTTTATGTA 
GAT TATAAACC CACCAACACGGAGAC CAGC GAGATGC TGC T GAATGGGAC GGGACCC T GCACC T A 
T AACAAAT CGGGCTCCAGGGAG T GT GAGG TATGAAC CAT TG TGAT AAAAAGAGC TCT T AAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GG T CAT TTTCCTCT C AT AC AT AAT CAAC C CAT T GAAAT T T AAAT AC C AC AAT CAAT G T GAAG C T T 
GAACT CCGGT TTAATATAATACCTAT TGTATAAGACCCT TTACTGAT T CCATTAATGTCGCATT T 
GTT TT AAGATAAAACT TC T T TCAT AGGTAAAAAAAAAAA 
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FIGURE 124. 

MGFNVI RLLS GSAVALVIAP TVLLTMLS SAERGCPKGCRCEGKMVYCES QKLQE I P S S I SAGCLG 

LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 

PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 

LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTW 

S GNE IEAFS GP S VFQCVPNLQRLNLDSNKLT FI GQE I LDS WI S LND I SLAGNI WECSRNI C S LVN 

WLKS FKGLRENT 1 1 CAS PKELQGVNVIDAVKNYS I CGKS TTERFDLARALPKPT FKPKLPRPKHE 

SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 

SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGC T TTC TAAAGCT T TGGCAAGGAGTGACACCCGC CAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAATCCAAC T GACCTAGT GAAGGT T CAGATGCAAATGGAAGGAAAAAGGAAAC TGGAAGG 
AAAACCATTGCGATT TCGTGGT GTACATCATGCATTTGCAAAAATCT TAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT T T T GTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCAT TT TAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVASLATFPLDLTKTRLQMQGEAALARLGDGARES 

APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 

VIGGMMAGVIGQFIiANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKILAEGG 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 

RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCG'GTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCGTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
Q AGGAGCGGAATG TGGACGATGGGCACATCAT TAACAT CAATAGCATG T CT GGCCACCGAGT G TTA 

0 CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

ul 

* 5 AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 

1 - j ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
B TCCTTCCCTCCCCACGCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

W. CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAAT T TGT T TGAGAT TTTTATA 

ill 

1^ TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

y. 

|i| TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
Q TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
111 TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGG I GAAVARALVQQGLKWGCART VGNI EELAAECKS AGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 

CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 

ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 

CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 

GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 

CCGT T GAGT GC T TGAAATGAGGAACT GAGAAAAT TAAT T T C TCATG TAT T T T TC T CAT T TATT TA . 

T TAATTT TTAAC TGATAGTTGTACATAT TTGGGGGTACATGTGATATT TGGATACATGTATACAA 

TATATAATGATCAAATCAGGGTAACT GGGATATCCATCACATCAAACATTTATT TT TTATTCTT T 

TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 

CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT . 

ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 

AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

T T C CAT C CAT G T T G C T GCAAAT G AC AG GAT TTCGTTCT TAAT T T C AAT T AAAAT AAC C AC AC AT G 

GCAAAAA 
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FIGURE iso 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE i3i 



T T CT GAAGT AAC GG AAGCT ACCTT GT AT AAAG AC CT CAAC ACTGCT GAC CATGATC AGCG CAGC CT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGG ATGCT ACAACTCT CT ACC T TC AGAAC AACC AAAT AAAT AAT GCT GGGATT CCT T CAGAT 
T T GAAAAACT TGCTGAAAGT AG AAAGAAT AT ACCT AT ACC ACAACAGTTT AGATGAAT TT CCT ACC AACCT 
C CCAAAGT AT GT AAAAGAGTTACATT TGC AAGAAAAT AAC AT AAGG AC TATC AC TT AT GATT C AC T TT C AA 
AAAT TC CCT ATC TGGAAG AAT T ACATTT AGATGACAAC T C TGT C TCT G C AGT T AGC ATAGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
M> GCCCAGGACT AT AGAAGAACT ACGCT TGGAT GAT AAT C GCAT ATCC AC TATTTC AT CACC AT C T CT TC AAG 

%J GTCT CACT AGTC TAAAACGCCT GGT T CT AG ATGGAAAC CT GT T G AAC AAT C AT GGT T T AGGT GACAAAGT T 

TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGC AC AAACCT GAGGAAG CT TT AT CT TCAAGAT AAC C ACATC AATCGGGT GCCCCC AAATGC TT TTT 
}i CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
j J: I GATG AT TT GG AC AAT AT AAC AC AACT GATT CTT C GC AACAAT CCCT G G T ATT GC GGGTGC AAGATG AAATG 

j!» G GTACGTG AC TG GT T AC AATC ACT ACCT GT GAAGGT CAAC GT GCGT GGGC TC ATGT GCCAAGCCCC AG AAA 

B AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 
p ACCATT CAGAT AACCACT GCAATACC C AAC ACAGT GTATCCT GC C C AAGGA.C AGT G GCC AGC T CC AGT GAC 

Iff CAAACAGCCAGATAT T AAG AACCCC AAGCT CACT AAGG AT CAAC AAACCAC AGGGAGTCC CT C AAG AAAAA 

C| CAAT TAC AAT TACTGT GAAGTCTGT CACC TCT GAT ACCAT TC AT ATCT CT TGGAAACTT GCT CTACCT ATG 

Wl ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCATGG AAAC CAGC AACC TCTACCT AT T TGAT GAAACT CCT GT T TGT ATT GAGACT GAAACT GCAC CCCTT 
CGAATGT AC AAC CCT AC AACCACCCT CAAT CG AG AG CAAGAGAAAGAACCT TAC AAAAAC CC CAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTC AT AGG AATGGAT C GCT CTT CT CAAGGAACT GT GC AT ATAGC AAAGGG AGGAGAAGAAAGGAT GAC TAT 
GC AG AAGCT GGC ACT AAGAAGG AC AACT CT AT C C T GGAAATC AGGGAAAC TTCT TT T CAGAT GTT ACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
AC AAAAACAATC AC AG TGAAAGC AGT AGT AAC CG AAGC T ACAGAGAC AGT GGT ATT CC AG ACTCAGAT CAC 
T C AC AC TCATGAT GCTGAAGGACT CACAGC AG AC TT GT G T TT TGGGT T TT TT AAACCTAAGGGAGGTG AT G 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVDdAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLPCVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VS AVS I EEGAFRDSNYLRLL FLSRNHLS T I PWGLPRT I EELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFS YLRQLYRLDMSNNNLSNLPQGI FDDLDNI TQLI LRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAEL FDCKDS G I VS T I Q I TTAI PNT VYPAQGQWPAPVTKQPD 
IKNPKLTKDQQT TGSPSRKT I T I TVKSVTSDT IHI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
ii IVTGERSEyLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
} PYKNPNLPLAAIIGGAVALVTIALIjALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospho lipase A2 a spar tic acid active site. 

amino acids 34-44 
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FIGURE iaa 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCC T GC GGCCT CT CAGAC CTC TGCAGGAGCCT CAT GACCAAGCCCGGCT GCT C 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
G ACCAC T CCAAC AGAGCC AGGACT AT AT CAACC T C T TC TGCGC C AAC AT GAT GGAC T T GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCTGATGC TGAAGATGAAGAATTAT CTAAAGCTATTCAA 
TAT CAGCAGC AT T T T T CGAGGAGAGT GAAGAGGC GAGAAAAACAATT T CCAGAT T CTCGC TC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
G T GGC TT CC TATACATCC TGGCAGAAT ACC CC C CAGCAAACAGAGAGC CACACC CATCCACACCG 
C CACCACCAAGCAGC CGC T GAGACGGAC GGT TC CAT GCCAGCT GCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE laj. 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT IADLI LSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAA.IQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
S SWDYRSVPPYLANFYI FLVETGFHHVAHAGLELL I SRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT T ACAGAT T T GAT CCCGT T CGAGT GGAT AT CAC T T CGAAAGGAAAAAT GAGAGCAAGAT AT G 
T GAAT T AC AT CAAAAC AT C AGAGGT T GT C AGAC TGCCC TATCC T C T CC AAAT GAAATC T T CAGG T 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAAT TCCAACCATGAGT TGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAATATTGATGTCACTGAAT TAAT TACAGTGTCC TATAGAAAATGCCAT TAATAAATTAT 
ATG AAC T AC TAT AC AT TATGT ATAT TAAT TAAAACATCT TAAT CCAGAAAT C AAAAAAAAAAAAA 
AAAAAAAAAAAAAAA . 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVG FLKT DGS FWHD I PS GSYWEWS PAYRFDPVRVD I T SKGKMRARYVN YI KT SE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMM^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

GlycosaminoglyGan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
H AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
O TGCAAGCCAAAGATGGAAACAT TGACATCAGAATCTTAAGGAGGACT GAGT CTTTGCAAGACACA 

JJJ AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
?lg AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
S\ CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
W AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
^ T T T GGGGGAAC TAGACAT T C T T CTGCAAT GGAT GGAGGAGACAGA ATAG GAG GAAAGT GATGC T G 

^ CTGCTAAG7VATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
|n ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
Q CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
ill TGTAATAT CTT TCTGCTATTGGATATATT TAT TAGTTAATATATT TATTTAT TT TT TGCTAT TTA 

. AT GT AT T TAT T T T T T TAG T T GGACAT GAAAC T T T AAAAAAATT CACAGAT TAT AT T T AT AACC TG 
ACTAGAGCAGG T GATGTAT T T T TATACAG T AAAAAAAAAAAAC CT TGTAAAT TC T AGAAGAG TGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATT TCTACAAATAAAGT T TTCT TT GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features off the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites . 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCT.CCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAA?U\AGGAGCATCTCACAGT 
T T T G TGGCTGAAGCAAAGC CACC TACAGT TAC TATGCC TC GAAT CAAGGCAT TGATGAC GGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAG CAAAAGCAGC TGGAAAAAGAATAGTCT T T TATGGAGAT GAAACCTGGGT TAAAT TA 
T TCC CAAAGCAT T T TGT GGAATAT GAT GGAACAACC T CAT T T T T CGTGTCAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATT TGGATAAAGTATTAAAAAGAGGAGATTGGGACATAT TAATCC 
H TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
Q AAG C T GAG C GAG AT GG AC AG C G T G C T GAT G AAGAT C C AC AC C T C AC T G C AG T C G AAGGAGAGAG A 

f% GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAA?ICAGGAAGTCACG 
fi\ GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
fl CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
ly ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
%\ CAAT GAGAGAGCAGT TGAGAT T T T TACAT T TGAATACAGT GCAGCT TAGTAAAC T G T TGCAAGAG 

I , | AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
*p GAACT GGATC AGAC TGTAC T TGGAGGAAAAGCAT TCAGAAGTC C TAT T CAAC C TGGGC TCCAAGG 

^ TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
* TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
S CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
|f| CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
^ GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
M AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
W TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
Q CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
m T CGGACAGCC T C CCAGCAGAGGTGT GGGAGCT GCAGC TGAGGGAAGAAGAGACAATC GGCC TGGA 

CAC T CAGGAGGG TCAAAAGGAGACT T GG T C GCACCAC TCATCC T GCCACCC CCAGAAT GCATCC T 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC T C AGAT C CAC AGAGC C C AG GAT C AAG G GAC C CAC T GCAG T G G C AGC AGGAC TGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAGTATG TAGT T ACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 140 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWD I L I LHYLGLDH I GH I S GPNS PL IGQKL SEMDS VLMK I HT S LQSKERE TPLP 
NLLVLCGDHGMS ETGS HGAS S TEEWTPL I L I S S AFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 



\i amino acids 58-76 

-si 

w 

•fa N-glycosylation sites. 

L. amino acids 56-60, 194-198 



N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 



O 275-281, 278-284 



Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CCC TAT AGAAAACAACT GCCAGCAC CT TAAGACCAC TCACAC C T T CAGAGT GAAGAAC TTAAAC 
C C G AAG AAAT T C AGC AT T CAT G AC C AG GAT CAC AAAG TAC TGGTCCTG GAC T C T GGGAAT C T CAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAGTCATCCAT CC C T T CAGCTGAAGAAGGAGAAAC TGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAATTTTCATTTCAACCAGTTTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC T AAT T G T AT AAAAACAC CAAAC C T GC T CAC T 



APP ID=10063734 



Page 295 



FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKL?\AQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNClSfEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 33-36 



N-rnyristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



,::~n. 



■S. S 



CT AGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGTGAC TCCACAT CCAGAGC TGCC TCCC T T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGT TTGC TGTCCACACATT CAAC CAACAGAGCAAGGAC TACTAT GCC TACAGAC T GGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAAC TAGG TG TGGGAAAT T T GAAGACGACAT T GAC AACT GC CAT T T CCAAGAAAGC AC AG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACAT C T T GAGCC TAATCAT GT AGTGTAGATC AT T AAAC AT CAGCAT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



CO 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNYMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 14* 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GAC GAGG GGC ATCAAGC ACAGAAT C AAG TGGAACC GGAAGGC CC T GC CCAGCAC TGCC CAGATC A 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT T CGGAGC CGAGGGCAACAGGTACT AC GAGGCCAAC TACT GGCAG T T C CCCGATGGCATCCA 
C TAC AAC GGC T GC T C T GAGGC T AAT GTGACCAAGGAGGCAT T T GT CAC CGGCT GCATCAAT GCC A 
CCCAGGCGGC GAACCAGGGGGAG T T CCAGAAGCCAGACAACAAGC TC CACCAGCAGGT GC TC TGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GT GT AT T CCC G T TT TACAGATAAGGAAAC T GAGGCCCAGAGAGC TGAAGT ACTGCACCC AGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TCT AAT GAAAT TGTGAAAGC TC CATGTT TAGAAATAAAT GAAAACAC CTGA 



Hi 
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FIGURE 146 



MRKHLSWWWLAT VCMLL FS HLSAVQTRG I KHR I KWNRKAL PS TAQI TE AQVAENRPGAF I KQGRK 
LD I D FGAEGNRYYEANYWQFPDG I HYNGCSEANVTKEAFVTGC I NAT QAANQGE FQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 7 6-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 



APP ID= 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
C AACT TT CAGAT TCAGGGGG TACATG TGAAGGT T TGT T T TATGAGTAT AT T GCATGATGC TGAGG 
TTTGGGGT 



m 

:- s 

4* 



U 

III 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 



APP ID=1 0063734 



Page 302 



FIGURE 14Q 



G TC T C C GC GT C AC AGGAAC T T CAGC AC C C AC AG GGC G GACAGC GCT C CC CT C T AC C T G GAGAC T T GAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACA7\AT CTCTACATAATAGAGAATTAAGTGCAGAAAGACCTT T GAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATATC CT CCAGAAAACAAGCCAGGT CAGAGCAACTAT T CTTT 
TGT T GATAACTTGAACCTGC TAAAGGCAATAACAGAAAAGGAAAAAAT T GAGAAAGAAAGACAAT CTA 
H TAAGAAGCTCCCCACT T GATAATAAGT TGAAT GTGGAAGATGT T GATT CAACCAAGAAT CGAAAACTG 

!!!? ATC GATGATTAT GACT CTACTAAGAGT GGATT GGAT CATAAAT T T CAAGAT GATCCAGAT GGT CT TCA 

fZ TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
|j| ■ AAAA.T GACAGAGCCGT GTT T GACAAGATT GTT T CTAAAC TACT TAATCT CGGC CTTATCACAGAAAGC 

SJ CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTT T TACAAAAATTAATCTCAAAGGAAGCCAACAA 

UJ TTAT GAGGAGGAT CCCAATAAGC CCACAAGCT GGACT GAGAATCAGGCT GGAAAAAT ACCAGAGAAAG 

s h TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
!L T TAACCTT GACAAATGGCTT GGAAAGGAGAACTAAAACCTACAGT GAAGACAACT T T GAGGAACT CCA 

S ATATT TCC CAAAT TTCT ATGCGCTACT GAAAAGTAT T GAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA. 

f*| CACT GATTACTATCATGAAAACACTGATT GAC T T T GT GAAGATGATGGT GAAATAT GGAACAATATCT 

pi CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
Q . AGAAAAAAATGCTACT GACAATATAAGCAAGC TT T TCCCAGCACCAT CAGAGAAGAGTCATGAAGAAA 

I'll CAGACAGT ACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATG GAAGCT TGAAGGAT T CCACAAAA 

GAT GATAACTCCAACC CAGGAGGAAAGAC AGAT GAACCCAAAGGAAAAACAGAAGCCTAT T TGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGC CATCAAGC GCAT T TATAGCAGCCT G TAAA AAT GGCAAAAGAT CCAGGAGT CT TT CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAA.CTAT CTGAAAGTAAAGT TGT AT GTAAGC T GAAAAAAAAAAAAAAAAAAAA 
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FIGURE lso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRSS PLDNKLNVEDVDSTKNRKLI DDYDS TKSGLDHK 
FQDDPDGLHQLDGT PLTAED I VHK I AARI YEENDRAVFDKI VSKLLNLGL I TE S QAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTLI DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDE PKGKTEAYLEAI RKNI EWLKKHDKKGNKEDYDLSKMRD FI NKQADAYVEKGI LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 
amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGT^AAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AAT AACCAGC TTC TAGC TGGAGGGC T GCAT GCAGGGAAGGT CAT TAAAGG T GAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAGG GCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
y s CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
£b» CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
Jjj TAAT GGT AACT GACAAGT G TTACCC T GAGCCC CGCAGGCCAACCCAT C CC CAGT TGAGCC T TATA 

P GGG T CAGTAGCTCTCCACATGAAGT CC T G T CAC TCAC CAC TGT GCAGGAGAGGGAGGT GGT CATA 

||1 GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
hi TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
0 GAAAT GGC T CGAGC T CAGAAGAT AAAAGATAAG TAGGGTATGCT GATC.CT C T TT TAAAAACCCAA 

s "'\ GAT AC AAT C AAAAT C C C AGAT GCTGGTCTC TAT T C C C ATGAAAAAG T G C T CAT G AC AT AT T GAGA 

AGACC TACTT ACAAAGTGGCATATAT TGCAAT T TAT T T TAAT TAAAAGAT ACCTAT TTATATAT T 
JK TCT T TAT AGAAAAAAGT CTGGAAGAGT T TACT T CAAT TG TAGCAATG T CAGGGTGGTGGCAG TAT 

AGGT GAT TTTTCTTT TAAT TC T GT T AAT T TAT C TGTAT T TCCTAAT T T T T C TACAATGAAGAT GA 
ATT CC TT GTATAAAAATAAGAAAAGAAAT TAAT C T TGAGGTAAGCAGAGCAGACATCAT C T C TGA 
J"Jj TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
U I T TGTAGTAGT GATCAGGAAACAGAT CTCAGCAAAGCCAC T GAGGAGGAGGCT GTGC TGAGT T TGT 

p GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
ffl GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
pi ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
|fj GT C T C TGCAGAT GTAGT TAGT T AAGACAAGGT CATGC TGGATGAAGGTAGAC CTAAAT T CAATAT 

ft! GACT GGT T T CC T TG TATGAAAAGGAGAGGACACAGAGAC AGAGGAGACGCGGGGAAGAC TATGTA 

AAGATGAAGGCAGAGATCGGAGT T T T GCAG CCACAAGCTAAGAAACAC CAAGGAT TGTGGCAACC 
AT CAGAAGCT T GGAAGAGGCAAAGAAGAAT TC T TCCC TAGAGGCTT T AGAGGGATAACGGC TC T G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
G CCACCAAGGATAAT TGGT TACAGCAGC TC TAGGAAAC TAATACAGC T GCT AAAATGAT CC C TG T 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGG CAGAAG TGATGGCATGCCAC T T CCAAGAT TAGG T TATAAAAGACACT GCAGC TTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAG T TGC T CAG T T T T GGT CTAACT T GTT AT GCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLT QLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 



APP ID=10063734 



Page 306 



FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 

CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 

CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 

TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 

CAACACAGACGTTCGTC TCAT T GGGGAGAAAC T GT TGCACGGAG TCAG TATGAG TGAGCGC TGCT 

ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 

CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 

TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 

j£» T T GGAGAGAGT GGAGAGAT CAAAGCAAT T GGAGAAC T GGAT TTGCTGTT TAT GT C TCT GAGAAAT 

p| GCC T GCAT T TGA CCAGAGCAAAGC T GAAAAATGAATAAC TAACCCCC T TT CCC T GC TAGAAATAA 

kK!? 

||! CAAT T AGATGCCCCAAAGCGAT T TT T T T TAACCAAAAGGAAGATGGGAAG CCAAACTCCATCAT G 

W ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T T T CTAAGCATAGATAT T TAT TGATAACAT T T CAT T GTAAC T GG TGT T C 

Ul 

w T AT AC AC AG AAAAC AAT T TAT T T T T T AAAT AAT TGTCTTTTTC C AT AAAAAAGAT T AC T T T C CAT 

$ TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

P T G TAT T TAT TAT TAT TAT AAG AC T G CAT T T T AT T TAT AT CAT T T TAT T AAT AT G GAT T TAT T TAT 

in AGAAACAT CAT TCGATATTGCTACT TGAGTGTAAGGC TAATAT TGATATT TATGACAATAAT TAT 

%? AGAGCT AT AACATGT TTAT TT GACC T CAATAAACAC T TGGATAT CCC 

III 

fU 
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FIGURE lsj. 

MAALQKSVSSFLMGTIATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTEMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCGTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC TT TC TAGATAT T TC CCCC T TGC TGGAGAAGAAAGAGCCCC TGGT T T TAT T 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACAT GGATGAT TC TGAGGAGGAAGCT GT TAT TGAATG TATAGAGAT T TATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 



iSffl 

0 

in 

3.5 3 



H 5 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRW.STVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide : 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACTTGAGGGACC TCCGAGTAGAACCTGTTACAAC TAGTGTTGCAACAGGGGAC TATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGT GT GACGGGCAAAAGCAACT TCCAGTCC TACAGCTGTGTGAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC TGAACACAG TC TAT T TCAT TGGGGC CCATAATAT TC C TAATGCAAAT ATGAATGAAGATGG 
CCCT TCCATGTCTGTGAAT T TCACCTCACCAGGC T GCCTAGACCACATAATGAAATATAAAAAAA 
Q AGT G T G TC AAGGG C GGAAGCC T GTGGGAT CCGAACATCAC T GC T T G T AAGAAGAATGAGGAGAC A 

fS GTAGAAGT GAAC T TCACAACCAC T CCCCT GGGAAACAGATACAT GGCT CT TAT CCAACACAGCAC 

|f! TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
III TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
y\ AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
M GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
s CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
[3 TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
111 TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
— AGT GGCAGAAAAAGAAAATAGCAGAGATGGG TC CAG TGCAGTGGCT T GCCAC TCAAAAGAAGGCA 

fii 

5* GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
?l i GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

? -5!? 

GT GAT C T AAGAAGC CAGAT TC AT CT GC ACAAATAC GT GG T GGT C TAC T T TAGAGAGAT TGAT AC A 
AAAGAC GAT T ACAATGC TC TCAGT GT C T GCCCCAAGTAC CACC T CAT GAAGGAT GC C AC TGC T T T 
CT GT GCAGAAC T T C T CCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGAT CACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLARLCRS AVPREPTVQCGSETGPSPEWMLQHDLI PGDLRDLRVEPVTTS VATGDY S ILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSWFTSPGCLDHI^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICY FTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCS DLRSQ I HLHKY WVY FRE IDTKDDYNAL S VCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 -182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCA^CATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCT GAAGAAG 
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FIGURE 160 



MTVKTLHGPA1WKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVTVGCTCVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



AC AC T G G C CAAAC AAAAAC GAAAGCAC TCCGTGCTG GAAG T AG GAG GAG AG T C AGGAC T CC CAG G 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATGC CTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACGTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTT'CGAGGGTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
H G GAAC TCAACCACACACAGCAGC TGC CT GC CC TGCCC TGGC TCAACGT GTCAGCAGAT GGTGACA 

Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
5l GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
J* CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
}*] CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
111 CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
%j GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
|.) TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
4 :s CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
* GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
O CAAGACC T GC AGTCAGGC CAG T GT C T GCAGC TATGGGAC GATGACT T GGGAGCGC TAT GGGCC T G 

ffl CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
%l CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 

CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CO CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
P CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
Sj CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
2t " GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAA TAAAGGCAGACGCTGTTTT TC TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
L RCQ KE T DCDLC L RVAVHLAVH GHWEEPEDEEK FGG AADSG V EE P RNASLQAQ WL S FQ AY P TARC VLL E V 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQS WLLDAPCSLPAE AALCWRAPGGDPCQ PL VP PLSWENVTVDKVLE FPLLKGH PNLCVQ 
VN S S E KLQLQ EC L WAD SLG P L KD D VL LL ET RG PQ DNRS LC AL EPSGCTSLPS KAST RAARLG E Y LLQ D LQ S 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence : 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site . 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGT GCGGGAGGC CGG T CAGCCACCAAGATGAC TGACAGGT T CAGC TC T CTGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACCACT TAGAGCTCCAGGTCAACCGCACCTACCAAATGCACC TTGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACAGAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
y* CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
h TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
m AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
\ll GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
C| GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
, 1 CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
* CACCTTAGGCCTAAAGGTCAGGTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
4* TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
5 CCC T GGGGAT T TGCACAGACAGAACATC TGACCCAAATGT GCTACACAGT GGGGAGGAAGGGACA 

pi CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
S CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
tj GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
ftj AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
PI ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
S AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
— AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
C AAG GC AGAAAT GAC AG T GCAAG GAG GAAAT G C AG GGAAAC T C C C GAG GT C C AGAGC C C CAC C T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGC TGGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T T CCCAGCCAGGGCAAC T GCC T GAC GT T GCAC GAT T TCAGC T T CAT T C CT C T GATAGAACAAAGC 
GAAATGCAG GTCCAC CAGGGAGGGAGACAC ACAAGCC T T T TC T GCAGGCAGGAGT T TCAGAC C CT 
AT CC TGAGAATGGGGT T T GAAAGGAAGG TGAGGGC T GTGGCCC CT GGACGGG TACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
AT CT GT AATGTGGGGAT CATAACACCT AC CT CATGGAGT T GTGGT GAAGAT GAAAT GAAG TCAT G 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175, 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGT GACGAT GTATATGT C TAC T ACAAAC TGGCAGGGGGAT CC C T T GAAC T T TGGGC T GGA 
AGTGTTGAACACAGT TTTGGATATT TTCCAAAAGATT TGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGCT ACATAT TCCAGCAGAT GAGACAGACT T TGTCTGCTTT GAAGGAGGAAGAGAT GAT T 
T TAATAGT TATAAT GTAGAAGAGCT T T TAGGAT CT T TGGAACT GGAGGAC T CTGTACC TGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTT GACCCT GT GCC TGAGCCC GAGGCAT TCAGAGC TGAT TCAGAGGAT GGAGAAGGT GC T T T C T 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GC T TACAAAGTCCT GAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGT GCGT TAT T 
CATTACAGCAAAGGATTTCGTT GGCATCAAAATCTAAGT T TGTTTTACAAAGAT TGT T TT TAGTA 

CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMmYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGS VEHS FGY FPKDL I KVLHKYTEEE LHI PADE TDFVC FEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMS QRGS GQCVI HYSKGFRWHQNLS L FYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N -glycosylate on site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCC T T GAGAAT GAAGCAGAACAGAAGTTAATAGAGAGCATG TT GCAAAACC T GACAAAACCC GGG 
ACAGGGAT T TC TGAT GGT GATT T CT GGAT AGGGC TT T GGAGGAATGGAGAT GGGCAAACATC TGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTAT GAAC CAGAGAT TAAT CCAACAGCCCC TGTAGAAAAGC C T TATC T TACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGT AAAGGAAGAACAAAAACT AGTC CAAACCAGT CT ACACT GT GGAT T TCAAAGAGTA 
C CAGAAAAGAAAGT GGCATGGAAGTATAATAAC T CAT TGAC TT GGT T C CAGAAT T T T GTAATTCT 
GGAT C TG TATAAGGAATGGCAT CAGAACAATAGC T T GGAATGGC T TGAAAT CACAAAGGATC TGC 
AAGAT GAAC T GTAAGC TCCCCC T T GAGGCAAATAT TAAAGTAAT T T T TATATGTC TAT TAT T TCA 
T T T AAAGAAT AT GC T G T GC T AAT AAT G GAG T GAGACAT GC T TAT T T T G C T AAAG GAT G C AC C C AA 
AC T T CAAAC T T CAAGCAAATGAAAT GGACAATGCAGATAAAGT T G T TAT CAACACGT C GGGAG T A 
TGTGTGTTAGAAGCAATTCCTTT TAT TTCTTTCACCTTTCATAAGTTGT TATC TAGTCAATGTAA 
TGTATATTGTAT TGAAAT TTACAGTGTGCAAAAG TAT TTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGT T C TAAT AT T TAT T T TTAT GGCAT C TCAT T T T TCAATACATGC T C T T T TGAT TAAAG 
AAAC TTAT TAC TGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTT TGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGC TATTTCATTAAGTGTGATATAAACCTCCTCAAACAT TT TACTTAGAGGCAAGGAT 
TGT CT AAT TT CAAT TGT GCAAGACAT GT GC CT TATAATTAT TT T T AGC T TAAAAT TAAACAGATT 
TT G TAAT AAT GTAAC T T T GTTAATAGGT GCATAAACAC TAATG CAGT CAAT TT GAACAAAAGAAG 
TGACATACACAATATAAATCATAT GT CT T CACACGT TGCC TATATAAT GAGAAGCAGC TC TC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT TGACACT GGAGGCAGATAGT T GCAAAGTTAGTC TAAGG T T T CCC TAGC T GTAT T TAGCC TCT G 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT TTTAGGAAGGAAAGGAAC TACGAAAT CGTGTGAAAATGGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PT I PLLLL I L VAFGTCC FQMLHKSKGRTKT S PNQ 
STLWISKST RKE S GME V 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 36-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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